44) /Si 19 

wo 2004,042346 ReCd PCWW 19 OCT 2004 *™ sMr ° 12946 

MRTflggg Alvm-f^MPn^iTinivs fob ni ahnqsing 
^AgtD4V#ONiTOTONG^fR AN S P L Ai VT REJECTION 

Related Applications 

This application claims priority to U.S. Application No. 10/131,831, filed April 24, 2002, and 
U.S. Application No. 10/325,899, filed December 20, 2002. 
Field of the Invention 

This invention is in the field of expression profiling following organ transplantation. 
Background of the Invention 

Many of the current shortcomings in diagnosis, prognosis, risk stratification and treatment of 
disease can be approached through the identification of the molecular mechanisms underlying a disease 
and through the discovery of nucleotide sequences (or sets of nucleotide sequences) whose expression 
patterns predict the occurrence or progression of disease states, or predict a patient's response to a 
particular therapeutic intervention. In particular, identification of nucleotide sequences and sets of 
nucleotide sequences with such predictive value from cells and tissues that are readily accessible would 
be extremely valuable. For example, peripheral blood is attainable from all patients and can easily be 
obtained at multiple time points at low cost. This is a desirable contrast to most other cell and tissue 
types, which are less readily accessible, or accessible only through invasive and aversive procedures. 
In addition, the various cell types present in circulating blood are ideal for expression profiling 
experiments as the many cell types in the blood specimen can be easily separated if desired prior to 
analysis of gene expression. While blood provides a very attractive substrate for the study of diseases 
using expression profiling techniques, and for the development of diagnostic technologies and the 
identification of therapeutic targets, the value of expression profiling in blood samples rests on the 
degree to which changes in gene expression in these cell types are associated with a predisposition to, 
and pathogenesis and progression of a disease. 

Hematopoiesis is the development and maturation of all cell types of the blood. These include 
erythrocytes, platelets and leukocytes. Leukocytes are further subdivided into granulocytes 
(neutrophils, eosinophils, basophils) and mononuclear cells (monocytes, lymphocytes). These cells 
develop and mature from precursor cells to replenish the circulating pool and to respond to insults and 
challenges to the system. This occurs in the bone marrow, spleen, thymus, liver, lymph nodes, mucosal 
associated lymphoid tissue (MALT) and peripheral blood. 

Precursor cells differentiate into immature forms of each lineage and these immature cells 
develop further into mature cells. This process occurs under the influence and direction of 
hematopoietic growth factors. When hematopoiesis is stimulated, there is an increase in the number of 
immature cells in the peripheral blood and in some cases, precursor cells are found at increased 
frequency. For example, CD34+ cells (hematopoietic stem cells) may increase in frequency in the 
peripheral blood with an insult to the immune system For neutrophils, "band" forms are increased, 
for erythrocytes, reticulocytes or nucleated red cells are seen. Lymphocytes are preceeded by 
lymphoblasts (immature lymphocytes). 

It may be an important clinical goal to measure the rate of production of blood cells of a 
variety of lineages. Hematological disorders involving over or under production of various blood cells 
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may be treated pharmacologically. For example, anemia (low red blood cells) may be treated with 
erythropoietin (a hematopoietic growth factor) and response to this therapy can be assessed by 
measuring RBC production rates. Low neutrophils counts can be treated by administration of G-CSF 
and this therapy may be monitored by measuring neutrophil production rates. Alternatively, the 
diagnosis of blood cell disorders is greatly facilitated by determination of lineage specific production 
rates. For example, anemia (low RBCs) may be caused by decreased cellular production or increased 
destruction of cells. In the latter case, the rate of cellular production will be increased rather than 
decreased and the therapeutic implications are very different. Further discussion of the clinical uses of 
measures of blood cell production rates is given in below. 

Assessment of blood cell production rates may be useful for diagnosis and management of 
non-hematological disorders. In particular, acute allograft rejection diagnosis and monitoring may 
benefit from such an approach. Current diagnosis and monitoring of acute allograft rejection is 
achieved through invasive allograft biopsy and assessment of the biopsy histology. This approach is 
sub-optimal because of expense of the procedure, cost, pain and discomfort of the patient, the need for 
trained physician operators, the risk of complications of the procedure, the lack of insight into the 
functioning of the immune system and variability of pathological assessment. In addition, biopsy can 
diagnose acute allograft rejection only after significant cellular infiltration into the allograft has 
occurred. At this point, the process has already caused damage to the allograft. For all these reasons, a 
simple blood test that can diagnose and monitor acute rejection at an earlier stage in the process is 
needed. Allograft rejection depends on the presence of functioning cells of the immune system. In 
addition, the process of rejection may cause activation of hematopoiesis. Finally, effective 
immunosuppressive therapy to treat or prevent acute rejection may suppress hematopoiesis. For these 
reasons, assessment of hematopoietic cellular production rates may be useful in the diagnosis and 
monitoring of acute rejection. 

Current techniques for measuring cellular development and production rates are inadequate. 
The most common approach is to measure the number of mature cells of a lineage of interest over time. 
For example, if a patient is being treated for anemia (low red blood cell counts), then the physician will 
order a blood cell count to assess the number of red blood cells (RBCs) in circulation. For this to be 
effective, the physician must measure the cell count over time and may have to wait 2-4 weeks before 
being able to assess response to therapy. The same limitation is true for assessment of any cell lineage 
in the blood. 

An alternative approach is to count the number of immature cells in the peripheral blood by 
counting them under the microscope. This may allow a more rapid assessment of cellular production 
rates, but is limited by the need for assessment by a skilled hematologist, observer variability and the 
inability to distinguish all precursor cells on the basis of morphology alone. 

Bone marrow biopsy is the gold standard for assessment of cellular production rates. In 
addition to the limitations of the need for skilled physicians, reader variability and the lack of 
sensitivity of morphology alone, the technique is also limited by the expense, discomfort to the patient 
and need for a prolonged visit to a medical center. Thus there is a need for a reliable, rapid means for 
measuring the rate of hematopoeisis in a patient. 
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In addition to the relationship between hematopoiesis and variety of disease processes, there is 
an extensive literature supporting the role of leukocytes, e.g., T-and B-lymphocytes, monocytes and 
granulocytes, including neutrophils, in a wide range of disease processes, including such broad classes 
as cardiovascular diseases, inflammatory, autoimmune and rheumatic diseases, infectious diseases, 
transplant rejection, cancer and malignancy, and endocrine diseases. For example, among 
cardiovascular diseases, such commonly occurring diseases as atherosclerosis, restenosis, transplant 
vasculopathy and acute coronary syndromes all demonstrate significant T cell involvement (Smith- 
Norowitz et al. (1999) Clin Immunol 93:168-175; Jude et al. (1994) Circulation 90:1662-8; Belch et al. 
(1997) Circulation 95:2027-31). These diseases are now recognized as manifestations of chronic 
inflammatory disorders resulting from an ongoing response to an injury process in the arterial tree 
(Ross et al. (1999) Ann Thorac Surg 67: 1428-33). Differential expression of lymphocyte, monocyte 
and neutrophil genes and their products has been demonstrated clearly in the literature. Particularly 
interesting are examples of differential expression in circulating cells of the immune system that 
demonstrate specificity for a particular disease, such as arteriosclerosis, as opposed to a generalized 
association with other inflammatory diseases, or for example, with unstable angina rather than 
quiescent coronary disease. 

A number of individual genes, e.g., CD1 lb/CD18 (Kassirer et al. (1999) Am Heart J 138:555- 
9); leukocyte elastase (Amaro et al. (1995) Eur Heart J 16:615-22; and CD40L (Aukrust et al. (1999) 
Circulation 100:614-20) demonstrate some degree of sensitivity and specificity as markers of various 
vascular diseases. In addition, the identification of differentially expressed target and fingerprint genes 
isolated from purified populations of monocytes manipulated in various in vitro paradigms has been 
proposed for the diagnosis and monitoring of a range of cardiovascular diseases, see, e.g., US Patents 
Numbers 6,048,709; 6,087,477; 6,099,823; and 6,124,433 "COMPOSITIONS AND METHODS FOR 
THE TREATMENT AND DIAGNOSIS OF CARDIOVASCULAR DISEASE" to Falb (see also, WO 
97/30065). Lockhart, in US Patent Number 6,033,860 "EXPRESSION PROFILES IN ADULT AND 
FETAL ORGANS" proposes the use of expression profiles for a subset of identified genes in the 
identification of tissue samples, and the monitoring of drug effects. 

The accuracy of technologies based on expression profiling for the diagnosis, prognosis, and 
monitoring of disease would be dramatically increased if numerous differentially expressed nucleotide 
sequences, each with a measure of specificity for a disease in question, could be identified and assayed 
in a concerted manner. PCT application WO 02/057414 "LEUKOCYTE EXPRESSION PROFILING" 
to Wohlgemuth identifies one such set of differentially expressed nucleotides. 

In order to achieve this improved accuracy, the sets of nucleotide sequences once identified 
need to be validated to identify those differentially expressed nucleotides within a given set that are 
most useful for diagnosis, prognosis, and monitoring of disease. The present invention addresses these 
and other needs, and applies to transplant rejection and detection of the rate of hematopoeisis for which 
differential regulation of genes, or other nucleotide sequences, of peripheral blood can be 
demonstrated. 
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Summary of the Invention 

In order to meet these needs, the present invention is thus directed to a system for detecting 
differential gene expression. In one format, method are provided for assessing the immune status of an 
individual by detecting the expression level of one or more genes expressed at different levels 
depending upon the rate of hematopoiesis or the distribution of hematopoietic cells along their 
maturation pathway in the individual. The one or more genes may include a nucleotide selected from a 
nucleotide sequence selected from SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ 
ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO: 10, SEQ ID NO:l 1, SEQ ID 
NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID 
NO:18, SEQ ID NO:19, SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID 
NO:24, SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:29, SEQ ID 
NO:30, SEQ ID NO:31, SEQ ID NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO:35, SEQ ID 
NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ ID NO:40, SEQ ID NO:41, SEQ ID 
NO:42, SEQ ID NO:43, SEQ ID NO:44, SEQ ID NO:45, SEQ ID NO:46, SEQ ID NO:47, SEQ ID 
NO:48, SEQ ID NO:49, SEQ ID NO:50, SEQ ID NO:51, SEQ ID NO:52, SEQ ID NO:53, SEQ ID 
NO:54, SEQ ID NO:55, SEQ ID NO:56, SEQ ID NO:57, SEQ ID NO:58, SEQ ID NO:59, SEQ ID 
NO:60, SEQ ID NO:61, SEQ ID NO:62, SEQ ID NO:63, SEQ ID NO:64, SEQ ID NO:65, SEQ ID 
NO:66, SEQ ID NO:67, SEQ ID NO:68, SEQ ID NO:69, SEQ ID NO:70, SEQ ID NO:71, SEQ ID 
NO:72, SEQ ID NO:73, SEQ ID NO:74, SEQ ID NO:75, SEQ ID NO:76, SEQ ID NO:77, SEQ ID 
NO:78, SEQ ID NO:79, SEQ ID NO:80, SEQ ID NO:81, SEQ ID NO:82, SEQ ID NO:83, SEQ ID 
NO:84, SEQ ID NO:85, SEQ ID NO:86, SEQ ID NO:87, SEQ ID NO:88, SEQ ID NO:89, SEQ ID 
NO:90, SEQ ID NO.91, SEQ ID NO:92, SEQ ID NO:93, SEQ ID NO:94, SEQ ID NO:95, SEQ ID 
NO:96, SEQ ID NO:97, SEQ ID NO:98, SEQ ID NO:99, SEQ ID NO: 100, SEQ ID NO: 101, SEQ ID 
NO:102, SEQ ID NO:103, SEQ ID NO:104, SEQ ID NO:105, SEQ ID NO:106, SEQ ID NO:107, SEQ 
ID NO: 108, SEQ ID NO: 109, SEQ ID NO:l 10, SEQ ID NO: 1 1 1, SEQ ID NO: 1 12, SEQ ID NO:l 13, 
SEQ ID NO: 1 14, SEQ ID NO: 1 1 5, SEQ ID NO: 1 1 6, SEQ ID NO: 1 1 7, SEQ ID NO: 1 1 8, SEQ ID 
NO:l 19, SEQ ID NO: 120, SEQ ID NO: 121, SEQ ID NO: 122, SEQ ID NO: 123, SEQ ID NO: 124, SEQ 
ID NO: 125, SEQ ID NO: 126, SEQ ID NO: 127, SEQ ID NO: 128, SEQ ID NO: 129, SEQ ID NO: 130, 
SEQ ID NO: 131, SEQ ID NO: 132, SEQ ID NO: 133, SEQ ID NO: 134, SEQ ID NO: 135, SEQ ID 
NO:136, SEQ IDNO:137, SEQ EDNO:138, SEQ ID NO:139, SEQ IDNO:140, SEQ ID NO:141, SEQ 
ID NO:142, SEQ ID NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ ID NO:147, 
SEQ ID NO: 148, SEQ ID NO: 149, SEQ ID NO: 150, SEQ ID NO: 151, SEQ ID NO: 152, SEQ ID 
NO: 153, SEQ ID NO: 154, SEQ ID NO: 155, SEQ ID NO: 156, SEQ ID NO: 157, SEQ ID NO: 158, SEQ 
ID NO: 159, SEQ ID NO:160, SEQ ID NO:161, SEQ ID NO:162, SEQ ID NO:163, SEQ ID NO:164, 
SEQ ID NO: 165, SEQ ID NO: 166, SEQ ID NO: 167, SEQ ID NO:168, SEQ ID NO: 169, SEQ ID 
NO: 1 70, SEQ ID NO: 1 7 1 , SEQ ID NO: 1 72, SEQ ID NO: 1 73, SEQ ID NO: 1 74, SEQ ID NO: 1 75 , SEQ 
ID NO:176, SEQ ID NO:177, SEQ ID NO:178, SEQ ID NO:179, SEQ ID NO:180, SEQ ID NO:181, 
SEQ ID NO: 182, SEQ ID NO: 183, SEQ ID NO: 184, SEQ ID NO: 185, SEQ ID NO: 186, SEQ ID 
NO: 1 87, SEQ ID NO: 1 88, SEQ ID NO: 1 89, SEQ ID NO: 1 90, SEQ ID NO: 1 9 1 , SEQ ID NO: 1 92, SEQ 
ID NO: 193, SEQ ID NO: 194, SEQ ID NO: 195, SEQ ID NO: 196, SEQ ID NO: 197, SEQ ID NO: 198, 
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SEQ ID NO: 199, SEQ ID NO:200, 
NO:204, SEQ ID NO:205, SEQ ID 
ID NO:210, SEQ ID NO:21 1, SEQ 
SEQ ID NO:216, SEQIDNO:217, 
NO:221, SEQ ID NO:222, SEQ ID 
ID NO:227, SEQ ID NO:228, SEQ 
SEQ ID NO:233, SEQ ID NO:234, 
NO:238, SEQ ID NO:239, SEQ ID 
ID NO:244, SEQ ID NO:245, SEQ 
SEQ ID NO:250, SEQ ID NO:251, 
NO:255, SEQ ID NO:256, SEQ ID 
ID NO:261, SEQ ID NO:262, SEQ 
SEQ ID NO:267, SEQ ID NO:268, 
NO:272, SEQ ID NO:273, SEQ ID 
ID NO:278, SEQ ID NO:279, SEQ 
SEQ ID NO:284, SEQ ID NO:285, 
NO:289, SEQ ID NO:290, SEQ ID 
CD NO:295, SEQ ID NO:296, SEQ 
SEQ ID NO:301, SEQ ID NO:302, 
NO:306, SEQ ID NO:307, SEQ ID 
ID NO:312, SEQ ID NO:3I3, SEQ 
SEQ IDNO:318, SEQ IDNO:319, 
NO:323, SEQ ID NO:324, SEQ ID 
ID NO:329, SEQ ID NO:330, SEQ 
NO:2645, 
NO:2667, 
NO:2743, 
NO:2671, 
NO:2677, 
NO:2657, 
NO:2664, 
NO:2713, 
NO:2680, 
NO:2750, 
NO:2647, 
NO:2635, 
NO:2691, 
NO:2631> 
NO:2730, 
NO:2662, 



SEQ ID NO:20t, SEQ ID NO:202, 
NO:206, SEQ ID NO:207, SEQ ID 
ID NO:2 1 2, SEQ ID NO:2 1 3, SEQ 
SEQ ID NO:218, SEQ ID NO:219, 
NO:223, SEQ ID NO:224, SEQ ID 
ID NO:229, SEQ ID NO:230, SEQ 
SEQ ID NO:235, SEQ ID NO:236, 
NO:240, SEQ ID NO:241, SEQ ID 
ID NO:246, SEQ ID NO:247, SEQ 
SEQ ID NO:252, SEQ ID NO:253, 
NO:257, SEQ ID NO:258, SEQ ID 
ID NO:263, SEQ ID NO:264, SEQ 
SEQ ID NO:269, SEQ ID NO:270, 
NO:274, SEQ ID NO:275, SEQ ID 
ID NO:280, SEQ ID NO:281, SEQ 
SEQ ID NO:286, SEQ ID NO:287, 
NO:291, SEQ ID NO:292, SEQ ID 
ID NO:297, SEQ ID NO:298, SEQ 
SEQ ID NO:303, SEQ ID NO:304, 
NO:308, SEQ ID NO:309, SEQ ID 
ID NO:314, SEQ ID NO:315, SEQ 
SEQ ID NO:320, SEQ ID NO:321, 
NO:325, SEQ ID NO:326, SEQ ID 
ID NO:331, SEQ ED NO:332, SEQ 



SEQ ID NO:203, SEQ ID 
NO:208, SEQ ID NO:209, SEQ 
ID NO:214, SEQ ID NO:2I5, 
SEQ ID NO:220, SEQ ID 
NO:225, SEQ ID NO:226, SEQ 
ID NO:231, SEQ ID NO:232, 
SEQ ID NO:237, SEQ ID 
NO:242, SEQ ID NO:243, SEQ 
ID NO:248, SEQ ID NO:249, 
SEQ ID NO:254, SEQ ID 
NO:259, SEQ ID NO:260, SEQ 
ID NO:265, SEQ ID NO:266, 
SEQIDNO:271,SEQID 
NO:276, SEQ ID NO:277, SEQ 
ID NO:282, SEQ ID NO:283, 
SEQ ID NO:288, SEQ ID 
NO:293, SEQ ID NO:294, SEQ 
ID NO:299, SEQ ID NO:300, 
SEQ ID NO:305, SEQ ID 
NO:310, SEQ ID NO:311, SEQ 
ID NO:3 1 6, SEQ ID NO:3 1 7, 
SEQ ID NO:322, SEQ ID 
NO:327, SEQ ID NO:328, SEQ 
IDNO:2697, SEQ ID 
ID 
ID 
ID 
ID 
ID 
ID 
ID 
ID 
ID 
ID 
ID 
ID 
ID 
ID 
ID 
ID 



SEQ ID NO:2707, SEQ ID NO:2679, SEQ ED NO:2717, SEQ ID NO:2646, SEQ 
SEQ ID NO:2706, SEQ ID NO:2740, SEQ ID NO:2669, SEQ ID NO:2674, SEQ 
SEQ ID NO:2716, SEQ ID NO:2727, SEQ ID NO:2721, SEQ ID NO:2641, SEQ 
SEQ ID NO:2752, SEQ ID NO:2737, SEQ ED NO:2719, SEQ ID NO:2684, SEQ 
SEQ ID NO:2748, SEQ ID NO:2703, SEQ ID NO:271 1, SEQ ID NO:2663, SEQ 
SEQ ID NO:2683, SEQ ID NO:2686, SEQ ID NO:2687, SEQ ID NO:2644, SEQ 
SEQ ID NO:2747, SEQ ID NO:2744, SEQ ID NO:2678, SEQ ID NO:2731, SEQ 
SEQ ID NO:2736, SEQ ID NO:2708, SEQ ID NO:2670, SEQ ID NO:2661, SEQ 
SEQ ID NO:2754, SEQ ID NO:2728, SEQ ID NO:2742, SEQ ID NO:2668, SEQ 
SEQ ID NO:2746, SEQ ID NO:2738, SEQ ID NO:2627, SEQ ID NO:2739, SEQ 
SEQ ID NO:2628, SEQ ID NO:2638, SEQ ID NO:2725, SEQ ID NO:2714, SEQ 
SEQ ID NO:2751, SEQ ID NO:2629, SEQ ID NO:2695, SEQ ID NO:2741, SEQ 
SEQ ID NO:2726, SEQ ID NO:2722, SEQ ID NO:2689, SEQ ID NO.2734, SEQ 
SEQ ID NO:2656, SEQ ID NO:2696, SEQ ID NO:2676, SEQ ID NO:2701, SEQ 
SEQ ID NO.2710, SEQ ID NO:2632, SEQ ID NO:2724, SEQ ID NO:2698, SEQ 
SEQ ID NO:2753, SEQ ID NO:2704, SEQ ID NO:2675, SEQ ID NO:2700, SEQ 
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NO.2640, SEQ ID NO:2723, SEQ ID NO:2658, SEQ ID NO:2688, SEQ ID NO:2735, SEQ ID 
NO:2702, SEQ ID NO:268I, SEQ ID NO:2755, SEQ ID NO:2715, SEQ ID NO:2732, SEQ ID 
NO:2652, SEQ ID NO:2651, SEQ ID NO:2718, SEQ ID NO:2673, SEQ ID NO:2733, SEQ ID 
NO:2712, SEQ ID NO:2659, SEQ ID NO:2654, SEQ ID NO:2636, SEQ ID NO:2639, SEQ ID 
NO:2690, SEQ ID NO:2705, SEQ ID NO:2685, SEQ ID NO:2692, SEQ ID NO:2693, SEQ ID 
NO:2648, SEQ ID NO:2650, SEQ ID NO:2720, SEQ ID NO:2660, SEQ ID NO:2666, SEQ ID 
NO:2699, SEQ ID NO:2633, SEQ ID NO:2672, SEQ ID NO:2642, SEQ ID NO:2682, SEQ ID 
NO:2655, SEQ ID NO:2630, SEQ ID NO:2745, SEQ ID NO:2643, SEQ ID NO:2694, SEQ ID 
NO:2749, SEQ ID NO:2665, SEQ ID NO:2649, SEQ ID NO:2637, SEQ ID NO:2634, SEQ ID 
NO:2709, SEQ ID NO:2653, SEQ ID NO:2729. The expression level may be detected by measuring 
the RNA level expressed by the one or more genes. In one variation, the RNA level is detected by 
PCR. In another variation, the RNA level is detected by hybridization. The expression level may also 
be detected by measuring one or more proteins expressed by the one or more genes. 

The present invention is further directed to methods of diagnosing or monitoring transplant 
rejection in an individual by detecting a rate of hematopoiesis. The detection may be applied directly 
to the individual, or to a sample isolated from the individual. Detection may be accomplished by RNA 
profiling assay, immunoassay, fluorescent activated cell sorting, protein assay, peripheral blood 
cytology assay, MRI imaging, bone marrow aspiration, and/or nuclear imaging. In one variation, the 
RNA profile assay is a PCR based assay. In another variation, the RNA profile assay is a hybridization 
based assay. The RNA profile assay may further include detecting the expression level of one or more 
genes in the individual where the one or more genes include a nucleotide sequence selected from SEQ 
ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, 
SEQ ID NO:9, SEQ ID NO: 10, SEQ ID NO:l 1, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ 
ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO:20, SEQ ID 
NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ED NO:25, SEQ ID NO:26, SEQ ID 
NO:27, SEQ ID NO:28, SEQ ID NO:29, SEQ ID NO:30, SEQ IDNO:31, SEQ ID NO:32, SEQ ID 
NO:33, SEQ ID NO:34, SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ ID 
NO:39, SEQ ID NO:40, SEQ ID NO:41, SEQ ID NO:42, SEQ ID NO:43, SEQ ID NO:44, SEQ ID 
NO:45, SEQ ID NO:46, SEQ ID NO:47, SEQ ID NO:48, SEQ ID NO:49, SEQ ID NO:50, SEQ ID 
NO:51, SEQ ID NO:52, SEQ ID NO:53, SEQ ID NO:54, SEQ ID NO:55, SEQ ID NO:56, SEQ ID 
NO:57, SEQ ID NO:58, SEQ ID NO:59, SEQ ID NO:60, SEQ ID NO:61, SEQ ID NO:62, SEQ ID 
NO:63, SEQ ID NO:64, SEQ ID NO:65, SEQ ID NO:66, SEQ ID NO:67, SEQ ID NO:68, SEQ ID 
NO:69, SEQ ID NO:70, SEQ ID NO:71, SEQ ID NO:72, SEQ ID NO:73, SEQ ID NO:74, SEQ ID 
NO:75, SEQ ID NO:76, SEQ ID NO:77, SEQ ID NO:78, SEQ ID NO:79, SEQ ID NO:80, SEQ ID 
NO:81, SEQ ID NO:82, SEQ ID NO:83, SEQ ID NO:84, SEQ ID NO:85, SEQ ID NO:86, SEQ ID 
NO:87, SEQ ID NO:88, SEQ ID NO:89, SEQ ID NO:90, SEQ ID NO:91, SEQ ID NO:92, SEQ ID 
NO:93, SEQ ID NO:94, SEQ ID NO:95, SEQ ID NO:96, SEQ ID NO:97, SEQ ID NO:98, SEQ ID 
NO:99, SEQ ID NO:100, SEQ ID NO:101, SEQ ID NO: 102, SEQ ID NO:103, SEQ ID NO:104, SEQ 
ID NO: 1 05, SEQ ID NO: 1 06, SEQ ID NO: 1 07, SEQ ID NO: 1 08, SEQ ID NO: 1 09, SEQ ID NO: 1 1 0, 
SEQ ID NO: 1 1 1 , SEQ ID NO: 1 1 2, SEQ ID NO: 1 1 3, SEQ ID NO: 1 14, SEQ ID NO: 1 1 5, SEQ ID 
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NO: 1 1 6, SEQ ID NO: 1 1 7, SEQ ID NO: 1 1 8, SEQ ID NO: 1 1 9, SEQ ID NO: 1 20, SEQ ID NO: 1 2 1 , SEQ 
ID NO: 1 22, SEQ ID NO: 1 23, SEQ ID NO: 1 24, SEQ ID NO: 1 25, SEQ ID NO: 1 26, SEQ ID NO: i 27, 
SEQ ID NO: 1 28, SEQ ID NO: 1 29, SEQ ID NO: 1 30, SEQ ID NO: 1 3 1 , SEQ ID NO: 1 32, SEQ ID 
NO: 1 33, SEQ ID NO: 1 34, SEQ ID NO: 135, SEQ ID NO: 1 36, SEQ ID NO: 1 37, SEQ ID NO: 1 38, SEQ 
ID NO: 139, SEQ ID NO: 140, SEQ ID NO: 14 1, SEQ ID NO: 142, SEQ ID NO: 143, SEQ ID NO: 144, 
SEQ ID NO: 1 45, SEQ ID NO: 1 46, SEQ ID NO: 1 47, SEQ ID NO: 1 48, SEQ ID NO: 1 49, SEQ ID 
NO: 1 50, SEQ ID NO: 1 5 1 , SEQ ID NO: 1 52, SEQ ID NO: 1 53, SEQ ID NO: 1 54, SEQ ID NO: 1 55, SEQ 
ID NO: 156, SEQ ID NO: 157, SEQ ID NO: 158, SEQ ID NO: 159, SEQ ID NO: 160, SEQ ID NO: 161, 
SEQ ID NO: 162, SEQ ID NO: 163, SEQ ID NO: 164, SEQ ID NO: 165, SEQ ID NO: 166, SEQ ID 
NO: 1 67, SEQ ID NO: 168, SEQ ID NO: 1 69, SEQ ID NO: 1 70, SEQ ID NO: 1 7 1 , SEQ ID NO: 1 72, SEQ 
ID NO: 173, SEQ ID NO: 174, SEQ ID NO: 175, SEQ ID NO: 176, SEQ ID NO: 177, SEQ ID NO: 178, 
SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 , SEQ ID NO: 1 82, SEQ ID NO: 1 83, SEQ ID 
NO: 1 84, SEQ ID NO: 1 85, SEQ ID NO: 1 86, SEQ ID NO: 1 87, SEQ ID NO: 1 88, SEQ ID NO: 1 89, SEQ 
ID NO: 190, SEQ ID NO: 191, SEQ ID NO: 192, SEQ ID NO: 193, SEQ ID NO: 194, SEQ ID NO: 195, 
SEQ ID NO: 196, SEQ ID NO: 197, SEQ ID NO: 198, SEQ ID NO: 199, SEQ ID NO:200, SEQ ID 
NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ 
ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID NO:212, 
SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID 
NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ 
ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, 
SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID 
NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ 
ID NO:241, SEQ ID NO:242, SEQ ID NO:243, SEQ ED NO:244, SEQ ID NO:245, SEQ ID NO:246, 
SEQ ID NO:247, SEQ ID NO:248, SEQ ID NO:249, SEQ ID NO:250, SEQ ID NO:251, SEQ ID 
NO:252, SEQ ID NO:253, SEQ ID NO:254, SEQ ID NO:255, SEQ ID NO:256, SEQ ID NO:257, SEQ 
ID NO:258, SEQ ID NO:259, SEQ ID NO:260, SEQ ID NO:261, SEQ ID NO:262, SEQ ID NO:263, 
SEQ ID NO:264, SEQ ID NO:265, SEQ ID NO:266, SEQ ID NO:267, SEQ ID NO:268, SEQ ID 
NO:269, SEQ ID NO:270, SEQ ID NO:271, SEQ ID NO:272, SEQ ID NO:273, SEQ ID NO:274, SEQ 
ID NO:275, SEQ ID NO:276, SEQ ID NO:277, SEQ ID NO:278, SEQ ID NO:279, SEQ ID NO:280, 
SEQ ID NO:281, SEQ ID NO:282, SEQ ID NO:283, SEQ ID NO:284, SEQ ID NO:285, SEQ ID 
NO:286, SEQ ID NO:287, SEQ ID NO:288, SEQ ID NO:289, SEQ ID NO:290, SEQ ID NO:291, SEQ 
ID NO:292, SEQ ID NO:293, SEQ ID NO:294, SEQ ID NO:295, SEQ ID NO:296, SEQ ID NO:297, 
SEQ ID NO:298, SEQ ID NO:299, SEQ ID NO:300, SEQ ID NO:301, SEQ ID NO:302, SEQ ID 
NO:303, SEQ ID NO:304, SEQ ID NO:305, SEQ ID NO:306, SEQ ID NO:307, SEQ ID NO:308, SEQ 
ID NO:309, SEQ ID NO:310, SEQ ID NO:31 1, SEQ ID NO:312, SEQ ID NO:313, SEQ ID NO:314, 
SEQ ID NO:315, SEQ ID NO:316, SEQ ID NO:317, SEQ ID NO:318, SEQ ID NO:319, SEQ ID 
NO:320, SEQ ID NO:321, SEQ ID NO:322, SEQ ID NO:323, SEQ ID NO:324, SEQ ID NO:325, SEQ 
ID NO:326, SEQ ID NO:327, SEQ ID NO:328, SEQ ID NO:329, SEQ IDNO:330, SEQ ID NO:331, 
SEQ ID NO:332, SEQ ID NO:2697, SEQ ID NO:2645, SEQ ID NO:2707, SEQ ID NO:2679, SEQ ID 
NO:2717, SEQ ID NO:2646, SEQ ID NO:2667, SEQ ID NO:2706, SEQ ID NO:2740, SEQ ID 
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NO:2669, SEQ ID NO:2674, SEQ ID NO:2743, SEQ ID NO:2716, SEQ ID NO:2727, SEQ ID 
NO:2721, SEQ ID NO:2641, SEQ ID NO:2671, SEQ ID NO:2752, SEQ ID NO:2737, SEQ ID 
NO:2719, SEQ ID NO:2684, SEQ ID NO:2677, SEQ ID NO:2748, SEQ ID NO:2703, SEQ ID 
NO.271 1, SEQ ID NO:2663, SEQ ID NO.2657, SEQ ID NO:2683, SEQ ID NO:2686, SEQ ID 
NO:2687, SEQ ID NO:2644, SEQ ID NO:2664, SEQ ID NO:2747, SEQ ID NO:2744, SEQ ID 
NO:2678, SEQ ID NO:2731, SEQ ID NO:2713, SEQ ID NO:2736, SEQ ID NO.-2708, SEQ ID 
NO.2670, SEQ ID NO:2661, SEQ ID NO:2680, SEQ ID NO:2754, SEQ ID NO:2728, SEQ ID 
NO:2742, SEQ ID NO:2668, SEQ ID NO.2750, SEQ ID NO.2746, SEQ ID NO.2738, SEQ ID 
NO.2627, SEQ ID NO:2739, SEQ ID NO:2647, SEQ ID NO:2628, SEQ ID NO:2638, SEQ ID 
NO:2725, SEQ ID NO:2714, SEQ ID NO:2635, SEQ ID NO.-2751, SEQ ID NO.2629, SEQ ID 
NO:2695, SEQ ID NO:2741, SEQ ID NO:2691, SEQ ID NO:2726, SEQ ID NO:2722, SEQ ID 
NO:2689, SEQ ID NO:2734, SEQ ID NO:2631, SEQ ID NO:2656, SEQ ID NO:2696, SEQ ID 
NO-.2676, SEQ ID NO:2701, SEQ ID NO.2730, SEQ ID NO:2710, SEQ ID NO:2632, SEQ ID 
NO:2724, SEQ ID NO:2698, SEQ ID NO:2662, SEQ ID NO:2753, SEQ ID NO:2704, SEQ ID 
NO:2675, SEQ ID NO:2700, SEQ ID NO:2640, SEQ ID NO:2723, SEQ ED NO:2658, SEQ ID 
NO:2688, SEQ ID NO:2735, SEQ ID NO:2702, SEQ ID NO:2681, SEQ ID NO:2755, SEQ ID 
NO:2715, SEQ ID NO:2732, SEQ ID NO:2652, SEQ ID NO:2651, SEQ ID NO:2718, SEQ ID 
NO.2673, SEQ ID NO:2733, SEQ ID NO:2712, SEQ ID NO:2659, SEQ ID NO:2654, SEQ ID 
NO:2636, SEQ ID NO:2639, SEQ ID NO:2690, SEQ ID NO:2705, SEQ ID NO:2685, SEQ ID 
NO:2692, SEQ ID NO:2693, SEQ ID NO:2648, SEQ ED NO:2650, SEQ ED NO:2720, SEQ ID 
NO:2660, SEQ ID NO:2666, SEQ ID NO:2699, SEQ ID NO:2633, SEQ ED NO.2672, SEQ ID 
NO:2642, SEQ ID NO.2682, SEQ ID NO.2655, SEQ ID NO:2630, SEQ ID NO:2745, SEQ ID 
NO:2643, SEQ ID NO:2694, SEQ ID NO:2749, SEQ ED NO:2665, SEQ ID NO:2649, SEQ ID 
NO:2637, SEQ ID NO:2634, SEQ ED NO.2709, SEQ ED NO:2653, SEQ ED NO:2729. Transplant 
rejection may include one or more of heart transplant rejection, kidney transplant rejection, liver 
transplant rejection, pancreas transplant rejection, pancreatic islet transplant rejection, lung transplant 
rejection, bone marrow transplant rejection, stem cell transplant rejection, xenotransplant rejection, and 
mechanical organ replacement rejection. 

In another aspect, the invention is directed to a method of diagnosing or monitoring transplant 
rejection in a patient by detecting the expression level of one or more genes in the patient to diagnose 
or monitor transplant rejection in the patient, wherein the one or more genes include a nucleotide 
sequence selected from SEQ ED NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, 
SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:l 1, SEQ ID NO:12, SEQ ID NO: 13, SEQ 
IDNO:14, SEQ ID NO:15, SEQ ED NO:16, SEQ ED NO:17, SEQ ID NO.18, SEQ ID NO.19, SEQ ID 
NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:25, SEQ ED 
NO:26, SEQ ID NO:27, SEQ ID NO:28, SEQ ED NO:29, SEQ ID NO:30, SEQ ID NO:3I, SEQ ID 
NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:37, SEQ ID 
NO:38, SEQ ID NO:39, SEQ ID NO:40, SEQ ID NO:41, SEQ ID NO:42, SEQ ID N0.43, SEQ ID 
NO:44, SEQ ID NO:45, SEQ ID NO:46, SEQ ID NO:47, SEQ ID NO:48, SEQ ID NO:49, SEQ ID 
NO.50, SEQ ID NO:51, SEQ ID NO:52, SEQ ID NO:53, SEQ ID NO:54, SEQ ID NO:55, SEQ ID 
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NO:56, SEQ ID NO:57, SEQ ID NO:58, SEQ ID NO:59, SEQ ID NO:60, SEQ ID NO:61, SEQ ID 
NO:62, SEQ ID NO:63, SEQ ID NO:64, SEQ ID NO:65, SEQ ID NO:66, SEQ ID NO:67, SEQ ID 
NO:68, SEQ ID NO:69, SEQ ID NO:70, SEQ ID NO:71, SEQ ID NO:72, SEQ ID NO:73, SEQ ID 
NO:74, SEQ ID NO:82, SEQ ID NO:83, SEQ ID NO:84, SEQ ID NO:85, SEQ ID NO:86, SEQ ID 
NO:87, SEQ ID NO.88, SEQ ID NO:90, SEQ ID NO:91, SEQ ID NO:92, SEQ ID NO:93, SEQ ID 
NO:94, SEQ ID NO:95, SEQ ID NO:96, SEQ ID NO:98, SEQ ID NO:101, SEQ ID NO: 102, SEQ ID 
NO: 1 03, SEQ ID NO: 1 04, SEQ ID NO: 1 05, SEQ ID NO: 1 06, SEQ ID NO: 1 07, SEQ ID NO: 1 08, SEQ 
ID NO: 1 09, SEQ ID NO: 1 1 4, SEQ ID NO: 1 1 5, SEQ ID NO: 1 1 6, SEQ ID NO: 1 1 7, SEQ ID NO: 1 1 8, 
SEQ ID NO: 1 1 9, SEQ ID NO: 1 20, SEQ ID NO: 1 2 1 , SEQ ID NO: 1 22, SEQ ID NO: 1 23 , SEQ ID . 
NO: 124, SEQ ID NO: 125, SEQ ID NO: 126, SEQ ID NO: 127, SEQ ID NO: 128, SEQ ID NO: 129, SEQ 
ID NO: 1 30, SEQ ID NO: 1 3 1 , SEQ ID NO: 1 32, SEQ ID NO: 1 33, SEQ ID NO: 1 34, SEQ ID NO: 1 35, 
SEQ ID NO: 1 36, SEQ ID NO: 137, SEQ ID NO: 1 38, SEQ ID NO: 1 39, SEQ ID NO: 1 52, SEQ ID 
NO: 153, SEQ ID NO: 154, SEQ ID NO: 155, SEQ ID NO: 156, SEQ ID NO: 157, SEQ ID NO: 15 8, SEQ 
ID NO: 159, SEQ ID NO: 160, SEQ ID NO: 161, SEQ ID NO: 162, SEQ ID NO: 163, SEQ ID NO: 164, 
SEQ ID NO: 165, SEQ ID NO: 166, SEQ ID NO: 167, SEQ ID NO: 168, SEQ ID NO: 169, SEQ ID 
NO: 170, SEQ ID NO: 171, SEQ ID NO: 172, SEQ ID NO: 173, SEQ ID NO: 174, SEQ ID NO: 175, SEQ 
ID NO:176, SEQ ID NO:177, SEQ ID NO:178, SEQ ID NO:179, SEQ ID NO:180, SEQ ID NO:181, 
SEQ ID NO:182, SEQ ID NO:183, SEQ ID NO:184, SEQ ID NO:185, SEQ ID NO:186, SEQ ID 
NO:187, SEQ ID NO:188, SEQ ID NO:189, SEQ ID NO:190, SEQ ID NO:191, SEQ ID NO:192, SEQ 
ID NO:193, SEQ ID NO:194, SEQ ID NO:195, SEQ ID NO:196, SEQ ID NO:197, SEQ ID NO:198, 
SEQ ID NO:199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID 
NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ 
ID NO:210, SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, 
SEQ ID NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID 
NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ 
ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, 
SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID 
NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, SEQ ID NO:242, SEQ ID NO:243, SEQ 
ID NO:244, SEQ ID NO:245, SEQ ID NO:246, SEQ ID NO:247, SEQ ID NO:248, SEQ ID NO:249, 
SEQ ID NO:250, SEQ ID NO:251, SEQ ID NO:252, SEQ ID NO:253, SEQ ID NO:254, SEQ ID 
NO:255, SEQ ID NO:256, SEQ ID NO:257, SEQ ID NO:258, SEQ ID NO:259, SEQ ID NO:260, SEQ 
ID NO:261, SEQ ID NO:262, SEQ ID NO:263, SEQ ID NO:264, SEQ ID NO:265, SEQ ID NO:266, 
SEQ ID NO:267, SEQ ID NO:268, SEQ ID NO:269, SEQ ID NO:270, SEQ ID NO:271, SEQ ID 
NO:272, SEQ ID NO:273, SEQ ID NO:274, SEQ ID NO:275, SEQ ID NO:276, SEQ ID NO:277, SEQ 
ID NO:278, SEQ ID NO:279, SEQ ID NO:280, SEQ ID NO:281, SEQ ID NO:282, SEQ ID NO:283, 
SEQ ID NO:284, SEQ ID NO:285, SEQ ID NO:286, SEQ ID NO:287, SEQ ID NO:288, SEQ ID 
NO:289, SEQ ID NO:290, SEQ ID NO:291, SEQ ID NO:292, SEQ ID NO:293, SEQ ID NO:294, SEQ 
ID NO:295, SEQ ID NO:296, SEQ ID NO:297, SEQ ID NO:298, SEQ ID NO:299, SEQ ID NO:300, 
SEQ ID NO:301, SEQ ID NO:302, SEQ ID NO.303, SEQ ID NO:304, SEQ ID NO:305, SEQ ID 
NO.306, SEQ ID NO:307, SEQ ID NO:308, SEQ ID NO:309, SEQ ID NO:310, SEQ ID NO:31 1, SEQ 
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ID NO:3 12, SEQ ID NO:3 1 3, SEQ ID NO:3 14, SEQ ID NO:3 1 5, SEQ ID N0:3 1 6, SEQ IE) NO:3 1 7, 
SEQ ID NO:3l8, SEQ ID NO:319, SEQ ID NO:320, SEQ ID NO:321, SEQ ID NO:322, SEQ ID 
NO:323, SEQ ID NO:324, SEQ ID NO:325, SEQ ID NO:326, SEQ ID NO:327, SEQ ID NO:328, SEQ 
ID NO:329, SEQ ID NO:330, SEQ ID NO:331, SEQ ID NO:332, SEQ ID NO:2697, SEQ ID 
NO:2645, SEQ ID NO:2707, SEQ ID NO:2679, SEQ ID NO:2717, SEQ ID NO:2646, SEQ ID 
NO:2667, SEQ ID NO:2706, SEQ ID NO:2740, SEQ ID NO:2669, SEQ ID NO:2674, SEQ ID 
NO:2743, SEQ ID NO:2716, SEQ ID NO:2727, SEQ ED NO:2721, SEQ ID NO:2641, SEQ ID 
NO:2671, SEQ ID NO:2752, SEQ ID NO:2737, SEQ ID NO:2719, SEQ ID NO:2684, SEQ ID 
NO:2677, SEQ ID NO:2748, SEQ ID NO:2703, SEQ ID NO:271 1, SEQ ID NO:2663, SEQ ID 
NO:2657, SEQ ID NO:2683, SEQ ID NO:2686, SEQ ID NO:2687, SEQ ID NO:2644, SEQ ID 
NO:2664, SEQ ID NO:2747, SEQ ID NO:2744, SEQ ID NO:2678, SEQ ID NO:2731, SEQ ID 
NO:2713, SEQ ID NO:2736, SEQ ID NO:2708, SEQ ID NO:2670, SEQ ID NO:2661, SEQ ID 
NO:2680, SEQ ID NO:2754, SEQ ID NO:2728, SEQ ID NO:2742, SEQ ID NO:2668, SEQ ID 
NO:2750, SEQ ID NO:2746, SEQ ID NO:2738, SEQ ID NO:2627, SEQ ID NO:2739, SEQ ID 
NO:2647, SEQ ID NO:2628, SEQ ID NO:2638, SEQ ID NO:2725, SEQ ID NO:2714, SEQ ID 
NO:2635, SEQ ID NO:2751, SEQ ID NO:2629, SEQ ID NO:2695, SEQ IDNO:2741, SEQ ID 
NO:2691, SEQ ID NO:2726, SEQ ID NO:2722, SEQ ID NO:2689, SEQ ID NO:2734, SEQ ID 
NO:2631, SEQ ID NO:2656, SEQ ID NO:2696, SEQ ID NO:2676, SEQ ID NO:2701, SEQ ID 
NO:2730, SEQ ID NO:2710, SEQ ID NO:2632, SEQ ID NO:2724, SEQ ID NO:2698, SEQ ID 
NO:2662, SEQ ID NO:2753, SEQ ID NO:2704, SEQ ID NO:2675, SEQ ID NO:2700, SEQ ID 
NO:2640, SEQ ID NO:2723, SEQ ID NO:2658, SEQ ID NO:2688, SEQ ID NO:2735, SEQ ID 
NO:2702, SEQ ID NO:2681, SEQ ID NO:2755, SEQ ID NO:2715, SEQ ID NO:2732, SEQ ID 
NO:2652, SEQ ID NO:2651, SEQ ID NO:2718, SEQ ID NO:2673, SEQ ID NO:2733, SEQ ID 
NO:2712, SEQ ID NO:2659, SEQ ID NO:2654, SEQ ID NO:2636, SEQ ID NO:2639, SEQ ID 
NO.2690, SEQ ID NO:2705, SEQ ID NO:2685, SEQ ID NO:2692, SEQ ID NO:2693, SEQ ID 
NO:2648, SEQ ID NO:2650, SEQ ID NO:2720, SEQ ID NO:2660, SEQ ID NO:2666, SEQ ID 
NO:2699, SEQ ID NO:2633, SEQ ED NO:2672, SEQ ID NO:2642, SEQ ID NO:2682, SEQ ID 
NO:2655, SEQ ID NO:2630, SEQ ID NO:2745, SEQ ID NO:2643, SEQ ID NO:2694, SEQ ID 
NO:2749, SEQ ID NO:2665, SEQ ID NO:2649, SEQ ID NO:2637, SEQ ID NO:2634, SEQ ID 
NO:2709, SEQ ID NO:2653, SEQ ID NO:2729. In one variation, the invention is further directed to 
detecting the expression level of one or more additional genes in the patient to diagnose or monitor 
transplant rejection in the patient, wherein the one or more additional genes include a nucleotide 
sequence selected from SEQ ID NO:8, SEQ ID NO:75, SEQ ID NO:76, SEQ ID NO:77, SEQ ID 
NO:78, SEQ ID NO:79, SEQ ED NO:80, SEQ ID NO:81, SEQ ID NO:89, SEQ ID NO:97, SEQ ID 
NO:99, SEQ ID NO: 100, SEQ ID NO:l 10, SEQ ID NO: 1 1 1, SEQ ID NO:l 12, SEQ ID NO:l 13, SEQ 
ID NO:140, SEQ ID NO:141, SEQ ID NO:142, SEQ ID NO:143, SEQ ID NO:144, SEQ ID NO:145, 
SEQ ID NO:146, SEQ ID NO:147, SEQ ID NO:148, SEQ ID NO:149, SEQ ID NO:150, SEQ ID 
NO: 15. 

In a further variation, the invention is directed to a method of diagnosing or monitoring 
cardiac transplant rejection in a patient by detecting the expression level of one or more genes in the 
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patient to diagnose or monitor cardiac transplant rejection in the patient wherein the one or more genes 
include a nucleotide sequence selected from SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID 
NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO: 10, SEQ ID NO: 1 1 , SEQ ID NO: 1 2, 
SEQ ID NO: 1 3, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 1 6, SEQ ID NO: 1 7, SEQ ID NO: 1 8, 
SEQ ID NO: 1 9, SEQ ID NO:20, SEQ ID NO:2 1 , SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, 
SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:29, SEQ ID NO:30, 
SEQ ID NO:31, SEQ ID NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO:35, SEQ ID NO:36, 
SEQ ID NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ ID NO:40, SEQ ID NO:41, SEQ ID NO:42, 
SEQ ID NO:43, SEQ ID NO:44, SEQ ID NO:45, SEQ ID NO:46, SEQ ID NO:47, SEQ ID NO:48, 
SEQ ID NO:49, SEQ ID NO:50, SEQ ID NO:51, SEQ ID NO:52, SEQ ID NO:53, SEQ ID NO:54, 
SEQ ID NO:55, SEQ ID NO:56, SEQ ID NO:57, SEQ ID NO:58, SEQ ID NO:59, SEQ ID NO:60, 
SEQ ID NO:61, SEQ ID NO:62, SEQ ID NO:63, SEQ ID NO:64, SEQ ID NO:65, SEQ ID NO:66, 
SEQ ID NO:67, SEQ ID NO:68, SEQ ID NO:69, SEQ ID NO:70, SEQ ID NO:71, SEQ ID NO:72, 
SEQ ID NO:73, SEQ ID NO:74, SEQ ID NO:75, SEQ ID NO:80, SEQ ID NO:81, SEQ ID NO:82, 
SEQ ID NO:83, SEQ ID NO:84, SEQ ID NO:85, SEQ ID NO:86, SEQ ID NO:87, SEQ ID NO:88, 
SEQ ID NO:89, SEQ ID NO:90, SEQ ID NO:91, SEQ ID NO:92, SEQ ID NO:93, SEQ ID NO:94, 
SEQ ID NO:95, SEQ ID NO:96, SEQ ID NO:98, SEQ ID NO:99, SEQ ID NO: 100, SEQ ID NO: 101, 
SEQ ID NO: 102, SEQ ID NO: 103, SEQ ID NO: 104, SEQ ID NO: 105, SEQ ID NO: 106, SEQ ID 
NO: 1 07, SEQ ID NO: 1 08, SEQ ID NO: 1 09, SEQ ID NO: 1 1 0, SEQ ID NO: 111, SEQ ID NO: 1 1 2, SEQ 
ID NO: 1 1 3, SEQ ID NO: 1 14, SEQ ID NO: 1 1 5, SEQ ID NO: 1 1 6, SEQ ID NO: 1 1 7, SEQ ID NO: 1 1 8, 
SEQ ID NO: 1 1 9, SEQ ID NO: 1 20, SEQ ID NO: 1 2 1 , SEQ ID NO: 1 22, SEQ ID NO: 1 23, SEQ ID 
NO: 124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ 
ID NO: 130, SEQ ID NO: 131, SEQ ID NO: 132, SEQ ID NO: 133, SEQ ID NO: 134, SEQ ID NO: 135, 
SEQ ID NO: 136, SEQ ID NO: 137, SEQ ID NO: 138, SEQ ID NO: 139, SEQ ID NO: 152, SEQ ID 
NO: 153, SEQ ID NO: 154, SEQ ID NO: 155, SEQ ID NO: 156, SEQ ID NO: 157, SEQ ID NO: 158, SEQ 
ID NO:159, SEQ ID NO:160, SEQ ID NO:161, SEQ ID NO:162, SEQ ID NO:163, SEQ ID NO:164, 
SEQ ID NO: 165, SEQ ID NO: 166, SEQ ID NO: 167, SEQ ID NO: 168, SEQ ID NO: 169, SEQ ID 
NO: 170, SEQ ID NO: 171, SEQ ID NO: 172, SEQ ID NO: 173, SEQ ID NO: 174, SEQ ID NO: 175, SEQ 
ID NO: 1 76, SEQ ID NO: 1 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 181, 
SEQ ID NO: 182, SEQ ID NO: 183, SEQ ID NO: 184, SEQ ID NO: 185, SEQ ID NO: 186, SEQ ID 
NO: 1 87, SEQ ID NO: 1 88, SEQ ID NO: 1 89, SEQ ID NO: 1 90, SEQ ID NO: 1 9 1 , SEQ ID NO: 1 92, SEQ 
ID NO:193, SEQ ID NO:194, SEQ ID NO:195, SEQ ID NO:196, SEQ ID NO:197, SEQ ID NO:198, 
SEQ ID NO: 199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID 
NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ 
ID NO:210, SEQ ID NO:211, SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, 
SEQ ID NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID 
NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ 
ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, 
SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID 
NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, SEQ ID NO:242, SEQ ID NO:243, SEQ 



11 



WO 2004/042346 



PCT/US2003/012946 



ID NO:244, SEQ ID NO:245, SEQ ID NO:246, SEQ ID NO:247, SEQ ID NO:248, SEQ ID NO:249, 
SEQ ID NO:250, SEQ ID NO:25 1, SEQ ID NO:252, SEQ ID NO:253, SEQ ID NO:254, SEQ ID 
NO:255, SEQ ID NO:256, SEQ ID NO:257, SEQ ID NO:258, SEQ ID NO:259, SEQ ID NO:260, SEQ 
ID NO:261, SEQ ID NO:262, SEQ ID NO:263, SEQ ID NO:264, SEQ ID NO:265, SEQ ID NO:266, 
SEQ ID NO:267, SEQ ID NO:268, SEQ ID NO:269, SEQ ID NO:270, SEQ ID NO:271, SEQ ID 
NO:272, SEQ ID NO:273, SEQ ID NO:274, SEQ ID NO:275, SEQ ID NO:276, SEQ ID NO:277, SEQ 
ID NO:278, SEQ ID NO:279, SEQ ID NO:280, SEQ ID NO:281 , SEQ ID NO:282, SEQ ID NO:283, 
SEQ ID NO:284, SEQ ID NO:285, SEQ ID NO:286, SEQ ID NO:287, SEQ ID NO:288, SEQ ID 
NO:289, SEQ ID NO:290, SEQ ID NO:291, SEQ ID NO:292, SEQ ID NO:293, SEQ ID NO:294, SEQ 
ID NO:295, SEQ ID NO:296, SEQ ID NO:297, SEQ ID NO:298, SEQ ID NO:299, SEQ ID NO:300, 
SEQ ID NO:301, SEQ ID NO:302, SEQ ID NO:303, SEQ ID NO:304, SEQ ID NO:305, SEQ ID 
NO:306, SEQ ID NO:307, SEQ ID NO:308, SEQ ID NO:309, SEQ ID NO:310, SEQ ID NO:31 1, SEQ 
ID NO:312, SEQ ID NO:313, SEQ ID NO:314, SEQ ID NO:315, SEQ IDNO:316, SEQ ID NO:317, 
SEQ ID NO:318, SEQ ID NO:319, SEQ ID NO:320, SEQ ID NO:321, SEQ ID NO:322, SEQ ID 
NO:323, SEQ ID NO:324, SEQ ID NO:325, SEQ ID NO:326, SEQ ID NO:327, SEQ ID NO:328, SEQ 
ID NO:329, SEQ ID NO:330, SEQ ID NO:331, SEQ ID NO:332. In one variation, the method 
includes detecting the expression level of one or more additional genes in the patient to diagnose or 
monitor cardiac transplant rejection in the patient, wherein the one or more additional genes include a 
nucleotide sequence selected from SEQ ID NO:8, SEQ ID NO:76, SEQ ID NO:77, SEQ ID NO:78, 
SEQ ID NO:79, SEQ ID NO:97, SEQ ID NO:140, SEQ ID NO:141, SEQ ID NO:142, SEQ ID 
NO:I43, SEQ ID NO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ ID NO:147, SEQ IDNO:148, SEQ 
ID NO: 149, SEQ ID NO: 1 50, SEQ ID NO: 151. 

The invention is also directed to a method of diagnosing or monitoring kidney transplant 
rejection in a patient by detecting the expression level of one or more genes in the patient to diagnose 
or monitor kidney transplant rejection in the patient wherein the one or more genes include a nucleotide 
sequence selected from SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, 
SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO: 10, SEQ ID NO:l 1, SEQ ID NO: 12, SEQ 
ID NO: 1 3, SEQ ID NO: 1 4, SEQ ID NO: 1 5, SEQ ID NO: 1 6, SEQ ID NO: 1 7, SEQ ID NO: 1 8, SEQ ID 
NO: 19, SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID 
NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:29, SEQ ID NO:30, SEQ ID 
NO:31, SEQ ID NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO:35, SEQ ID NO:36, SEQ ID 
NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ ID NO:40, SEQ ID NO:41, SEQ ID NO:42, SEQ ID 
NO:43, SEQ ID NO:44, SEQ ID NO:45, SEQ ID NO:46, SEQ ID NO:47, SEQ ID NO:48, SEQ ID 
NO:49, SEQ ID NO:50, SEQ ID NO:51, SEQ ID NO:52, SEQ ID NO:53, SEQ ID NO:54, SEQ ID 
NO:55, SEQ ID NO:56, SEQ ID NO:57, SEQ ID NO:58, SEQ ID NO:59, SEQ ID NO:60, SEQ ID 
NO:61, SEQ ID NO:62, SEQ ID NO:63, SEQ ID NO:64, SEQ ID NO:65, SEQ ID NO:66, SEQ ID 
NO:67, SEQ ID NO:68, SEQ ID NO:69, SEQ ID NO:70, SEQ ID NO:71, SEQ ID NO:72, SEQ ID 
NO:73, SEQ ID NO:74, SEQ ID NO:78, SEQ ID NO:82, SEQ ID NO:83, SEQ ID NO:84, SEQ ID 
NO:85, SEQ ID NO:86, SEQ ID NO:87, SEQ ID NO:88, SEQ ID NO:90, SEQ ID NO:91, SEQ ID 
NO:92, SEQ ID NO:93, SEQ ID NO:94, SEQ ID NO:95, SEQ ID NO:96, SEQ ID NO:97, SEQ ID 
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NO:98, SEQ ID NOrlOl, SEQ ID NO:102, SEQ ID NO.103, SEQ ID NO:104, SEQ ID NO:105, SEQ 
ID NO: 1 06, SEQ ID NO: 1 07, SEQ ID NO: 108, SEQ ID NO: 1 09, SEQ ID NO: 1 14, SEQ ID NO: 1 1 5, 
SEQ ID NO.l 16, SEQ ID NO:l 17, SEQ ID NO:l 18, SEQ ID NO:l 19, SEQ ID NO: 120, SEQ ID 
NO: 1 2 1 , SEQ ID NO: 1 22, SEQ ID NO: 1 23, SEQ ID NO: 1 24, SEQ ID NO: 1 25, SEQ ID NO: 1 26, SEQ 
ID NO: 127, SEQ ID NO: 128, SEQ ID NO: 129, SEQ ID NO: 130, SEQ IDNO:131, SEQ ID NO: 132, 
SEQ ID NO: 133, SEQ ID NO: 134, SEQ ID NO: 135, SEQ ID NO: 136, SEQ ID NO: 137, SEQ ID 
NO: 138, SEQ ID NO: 139, SEQ ID NO: 152, SEQ ID NO. 153, SEQ ID NO: 154, SEQ ID NO: 155, SEQ 
ID NO: 156, SEQ ID NO: 157, SEQ ID NO: 158, SEQ ID NO: 159, SEQ ID NO: 160, SEQ ID NO: 161, 
SEQ ID NO:162, SEQ ID NO:163, SEQ IDNO:164, SEQ ID NO:165, SEQ ID NO:166, SEQ ID 
NO: 1 67, SEQ ID NO: 1 68, SEQ ID NO: 1 69, SEQ ID NO: 1 70, SEQ ID NO: 1 7 1 , SEQ ID NO: 1 72, SEQ 
ID NO: 173, SEQ ID NO: 174, SEQ ID NO: 175, SEQ ID NO: 176, SEQ ID NO: 177, SEQ ID NO: 178, 
SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 18 1 , SEQ ID NO: 1 82, SEQ ID NO: 1 83, SEQ ID 
NO: 184, SEQ ID NO: 185, SEQ ID NO: 186, SEQ ID NO: 187, SEQ ID NO: 188, SEQ ID NO: 189, SEQ 
ID NO:190, SEQ ID NO:191, SEQ ID NO:192, SEQ ID NO:193, SEQ ID NO:194, SEQ ID NO:195, 
SEQ ID NO: 196, SEQ ID NO: 197, SEQ ID NO: 198, SEQ ID NO: 199, SEQ ID NO:200, SEQ ID 
NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ 
ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID NO:212, 
SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID 
NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ 
ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, 
SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID 
NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ 
ID NO:241, SEQ ID NO:242, SEQ ID NO:243, SEQ ID NO:244, SEQ ID NO:245, SEQ ID NO:246, 
SEQ ID NO:247, SEQ ID NO:248, SEQ ID NO:249, SEQ ID NO:250, SEQ ID NO:251, SEQ ID 
NO:252, SEQ ID NO:253, SEQ ID NO:254, SEQ ID NO:255, SEQ ID NO:256, SEQ ID NO:257, SEQ 
ID NO:258, SEQ ID NO:259, SEQ ID NO:260, SEQ ID NO:261, SEQ ID NO:262, SEQ ID NO:263, 
SEQ ID NO:264, SEQ ID NO:265, SEQ ID NO:266, SEQ ID NO:267, SEQ ID NO:268, SEQ ID 
NO:269, SEQ ID NO:270, SEQ ID NO:271, SEQ ID NO:272, SEQ ID NO:273, SEQ ID NO:274, SEQ 
ID NO:275, SEQ ID NO:276, SEQ ID NO:277, SEQ ID NO:278, SEQ ID NO:279, SEQ ID NO.280, 
SEQ ID NO:281, SEQ ID NO:282, SEQ ID NO:283, SEQ ID NO:284, SEQ ID NO:285, SEQ ID 
NO:286, SEQ ID NO:287, SEQ ID NO:288, SEQ ID NO:289, SEQ ID NO:290, SEQ ID NO:291, SEQ 
ID NO:292, SEQ ID NO:293, SEQ ID NO:294, SEQ ID NO:295, SEQ ID NO:296, SEQ ID NO:297, 
SEQ ID NO:298, SEQ ID NO:299, SEQ ID NO:300, SEQ ID NO:301, SEQ ID NO:302, SEQ ID 
NO:303, SEQ ID NO:304, SEQ ID NO:305, SEQ ID NO:306, SEQ ID NO:307, SEQ ID NO:308, SEQ 
ID NO:309, SEQ ID NO:310, SEQ ID NO:31 1, SEQ ID NO:312, SEQ ID NO:313, SEQ ID NO:314, 
SEQ ID NO:315, SEQ ID NO:316, SEQ IDNO:317, SEQ ID NO:318, SEQ ID NO:319, SEQ ID 
NO:320, SEQ ID NO:321, SEQ ID NO:322, SEQ ID NO:323, SEQ ID NO:324, SEQ ID NO:325, SEQ 
ID NO:326, SEQ ID NO:327, SEQ ID NO:328, SEQ ID NO:329, SEQ ID NO:330, SEQ ID NO:331, 
SEQ ID NO:332, SEQ ID NO:2697, SEQ ID NO:2645, SEQ ID NO:2707, SEQ ID NO:2679, SEQ ID 
NO:2717, SEQ ID NO:2646, SEQ ID NO:2667, SEQ ID NO:2706, SEQ ID NO:2740, SEQ ID 
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NO:2669, SEQ ID NO:2674, SEQ ID NO:2743, SEQ ID NO:2716, SEQ ID NO:2727> SEQ ID 
NO:2721, SEQ ID NO:2641, SEQ ID NO:2671, SEQ ID NO:2752, SEQ ID NO:2737, SEQ ID 
NO:2719, SEQ ID NO:2684, SEQ ID NO:2677, SEQ ID NO:2748, SEQ ID NO:2703, SEQ ID 
NO:271 1, SEQ ID NO:2663, SEQ ID NO:2657, SEQ ID NO:2683, SEQ ID NO:2686, SEQ ID 
NO:2687, SEQ ID NO:2644, SEQ ID NO:2664, SEQ ID NO:2747, SEQ ID NO:2744, SEQ ID 
NO:2678, SEQ ID NO:2731, SEQ ID NO:2713, SEQ ID NO:2736, SEQ ID NO:2708, SEQ ID 
NO:2670, SEQ ID NO:2661, SEQ ID NO:2680, SEQ ID NO:2754, SEQ ID NO.2728, SEQ ID 
NO:2742, SEQ ID NO:2668, SEQ ID NO:2750, SEQ ID NO:2746, SEQ ID NO:2738, SEQ ID 
NO:2627, SEQ ID NO:2739, SEQ ID NO:2647, SEQ ID NO:2628, SEQ ID NO:2638, SEQ ID 
NO:2725, SEQ ID NO:2714, SEQ ID NO:2635, SEQ ID NO:2751, SEQ ID NO:2629, SEQ ID 
NO:2695, SEQ ID NO:2741, SEQ ID NO:2691, SEQ ID NO:2726, SEQ ID NO:2722, SEQ ID 
NO:2689, SEQ ID NO:2734, SEQ ID NO:2631, SEQ ID NO:2656, SEQ ID NO:2696, SEQ ID 
NO:2676, SEQ ID NO:2701, SEQ ID NO:2730, SEQ ID NO:2710, SEQ ID NO:2632, SEQ ED 
NO:2724, SEQ ID NO:2698, SEQ ID NO:2662, SEQ ID NO:2753, SEQ ID NO:2704, SEQ ID 
NO:2675, SEQ ID NO:2700, SEQ ID NO:2640, SEQ ID NO:2723, SEQ ID NO:2658, SEQ ID 
NO:2688, SEQ ID NO:2735, SEQ ID NO:2702, SEQ ID NO:2681, SEQ ID NO:2755, SEQ ID 
NO:2715, SEQ ID NO:2732, SEQ ID NO:2652, SEQ ID NO:2651, SEQ ID NO:2718, SEQ ID 
NO:2673, SEQ ID NO:2733, SEQ ID NO:2712, SEQ ID NO:2659, SEQ ID NO:2654, SEQ ID 
NO:2636, SEQ ID NO:2639, SEQ ID NO:2690, SEQ ID NO:2705, SEQ ID NO:2685, SEQ ID 
NO:2692, SEQ ID NO:2693, SEQ ID NO:2648, SEQ ID NO:2650, SEQ ID NO:2720, SEQ ID 
NO:2660, SEQ ID NO:2666, SEQ ID NO:2699, SEQ ID NO:2633, SEQ ID NO:2672, SEQ ID 
NO:2642, SEQ ID NO:2682, SEQ ID NO:2655, SEQ ID NO:2630, SEQ ID N0 2745, SEQ ID 
NO:2643, SEQ ID NO:2694, SEQ ID NO:2749, SEQ ID NO:2665, SEQ ID NO:2649, SEQ ID 
NO:2637, SEQ ID NO:2634, SEQ ID NO:2709, SEQ ID NO:2653, SEQ ID NO:2729. In one 
variation, the method further includes detecting the expression level of one or more additional genes in 
the patient to diagnose or monitor kidney transplant rejection in a patient, wherein the one or more 
additional genes includes a nucleotide sequence selected from SEQ ID NO: 75, SEQ ID NO:76, SEQ 
ID NO:77, SEQ ID NO:79, SEQ ID NO:80, SEQ ID NO:81, SEQ ID NO:89, SEQ ID NO:99, SEQ ID 
NO: 1 00, SEQ ID NO: 1 1 0, SEQ ID NO: 111, SEQ ID NO: 1 1 2, SEQ ID NO: 1 1 3, SEQ ID NO: 140, SEQ 
ID NO:141, SEQ ID NO:142, SEQ ID NO:143, SEQ ID NO: 144, SEQ ID NO:145, SEQ ID NO:146, 
SEQ ID NO: 147, SEQ ID NO: 148, SEQ ID NO: 149, SEQ ID NO: 150, SEQ ID NO: 151. 

In another aspect, the methods of diagnosing or monitoring transplant rejection include 
detecting the expression level of at least two of the genes. In another variation, methods of diagnosing 
or monitoring transplant rejection include detecting the expression level of at least ten of the genes. In 
a further variation, the methods of diagnosing or monitoring transplant rejection include detecting the 
expression level of at least one hundred of the genes. In still a further variation, the methods of 
diagnosing or monitoring transplant rejection include detecting the expression level of all the listed 
genes. 

In another variation, transplant rejection may be selected from heart transplant rejection, 
kidney transplant rejection, liver transplant rejection, pancreas transplant rejection, pancreatic islet 



14 



WO 2004/042346 



PCT/US2003/0 12946 



transplant rejection, lung transplant rejection, bone marrow transplant rejection, stem cell transplant 
rejection, xe no transplant rejection, and mechanical organ replacement rejection. 

In another aspect, the methods of detecting transplant rejection include detecting the 
expression level by measuring the RNA level expressed by one or more genes. The method may 
further including isolating RNA from the patient prior to detecting the RNA level expressed by the one 
or more genes. 

In one variation, the RNA level is detected by PCR. In a still further variation, the PCR uses 
primers consisting of nucleotide sequences selected from the group consisting of SEQ ID NO:665, 
SEQ ID NO:666, SEQ ID NO:667, SEQ ID NO:668, SEQ ID NO:669, SEQ ID NO:670, SEQ ID 
NO:671, SEQ ID NO:672, SEQ ID NO:673, SEQ ID NO:674, SEQ ID NO:675, SEQ ID NO:676, SEQ 
ID NO:677, SEQ ID NO:678, SEQ ID NO:679, SEQ ID NO:680, SEQ ID NO:681, SEQ ID NO:682, 
SEQ ID NO:683, SEQ ID NO.684, SEQ ID NO:685, SEQ ID NO:686, SEQ ID NO:687, SEQ ID 
NO:688, SEQ ID NO:689, SEQ ID NO:690, SEQ ID NO:691, SEQ ID NO:692, SEQ ID NO:693, SEQ 
ID NO:694, SEQ ID NO:695, SEQ ID NO:696, SEQ ID NO:697, SEQ ID NO:698, SEQ ID NO:699, 
SEQ ID NO:700, SEQ ID NO:701, SEQ ID NO:702, SEQ ID NO:703, SEQ ID NO:704, SEQ ID 
NO:705, SEQ ID NO:706, SEQ ID NO:707, SEQ ID NO:708, SEQ ID NO:709, SEQ ID NO:710, SEQ 
ID NO:71 1, SEQ ID NO:712, SEQ ID NO:713, SEQ ID NO:714, SEQ ID NO:715, SEQ ID NO:716, 
SEQ ID NO:717, SEQ ID NO:718, SEQ ID NO:719, SEQ ID NO:720, SEQ ID NO:721, SEQ ID 
NO:722, SEQ ID NO:723, SEQ ID NO:724, SEQ ID NO:725, SEQ ED NO:726, SEQ ID NO:727, SEQ 
ID NO:728, SEQ ID NO:729, SEQ ID NO:730, SEQ ID NO:731, SEQ ID NO:732, SEQ ID NO:733, 
SEQ ID NO:734, SEQ ID NO:735, SEQ ID NO:736, SEQ ID NO:737, SEQ ID NO:738, SEQ ID 
NO:739, SEQ ID NO:740, SEQ ID NO:741, SEQ ID NO:742, SEQ ID NO:743, SEQ ID NO:744, SEQ 
ID NO:745, SEQ ID NO:746, SEQ ID NO:747, SEQ ID NO:748, SEQ ID NO:749, SEQ ID NO:750, 
SEQ ID NO:751, SEQ ID NO:752, SEQ ID NO:753, SEQ ID NO:754, SEQ ID NO:755, SEQ ID 
NO:756, SEQ ID NO:757, SEQ ID NO:758, SEQ ID NO:759, SEQ ID NO:760, SEQ ID NO:761, SEQ 
ID NO:762, SEQ ID NO:763, SEQ ID NO:764, SEQ ID NO:765, SEQ ID NO:766, SEQ ID NO:767, 
SEQ ID NO:768, SEQ ID NO:769, SEQ ID NO:770, SEQ ID NO:771, SEQ ID NO:772, SEQ ID 
NO:773, SEQ ID NO:774, SEQ ID NO:775, SEQ ID NO:776, SEQ ID NO:777, SEQ ID NO:778, SEQ 
ID NO:779, SEQ ID NO:780, SEQ ID NO:781, SEQ ID NO:782, SEQ ID NO:783, SEQ ID NO:784, 
SEQ ID NO:785, SEQ ID NO:786, SEQ ID NO:787, SEQ ID NO:788, SEQ ID NO:789, SEQ ID 
NO:790, SEQ ID NO:791, SEQ ID NO:792, SEQ ID NO:793, SEQ ID NO:794, SEQ ID NO:795, SEQ 
ID NO:796, SEQ ID NO:797, SEQ ID NO:798, SEQ ID NO:799, SEQ ID NO:800, SEQ ID NO:801, 
SEQ ID NO:802, SEQ ID NO:803, SEQ ID NO:804, SEQ ID NO:805, SEQ ID NO:806, SEQ ID 
NO:807, SEQ ID NO:808, SEQ ID NO:809, SEQ ID NO:810, SEQ ID NO:81 1, SEQ ID NO:812, SEQ 
ID NO:813, SEQ ID NO:814, SEQ ID NO:815, SEQ ID NO:816, SEQ ID NO:817, SEQ ID NO:818, 
SEQ ID NO:819, SEQ ID NO:820, SEQ ID NO:821, SEQ ID NO:822, SEQ ID NO:823, SEQ ID 
NO:824, SEQ ID NO:825, SEQ ID NO:826, SEQ ID NO:827, SEQ ID NO:828, SEQ ID NO:829, SEQ 
ID NO:830, SEQ ID NO:831, SEQ ID NO:832, SEQ ID NO:833, SEQ ID NO:834, SEQ ED NO:835, 
SEQ ID NO:836, SEQ ID NO:837, SEQ ID NO:838, SEQ ID NO:839, SEQ ID NO:840, SEQ ID 
NO:841, SEQ ID NO:842, SEQ ID NO:843, SEQ ID NO:844, SEQ ID NO:845, SEQ ID NO:846, SEQ 
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ID NO:847, SEQ ID NO:848, SEQ ID NO:849, SEQ ID NO:850, SEQ ID NO:851, SEQ ID NO:852, 
SEQ ID NO:853, SEQ ID NO:854, SEQ ID NO:855, SEQ ID NO:856, SEQ ID NO:857, SEQ ID 
NO:858, SEQ ID NO:859, SEQ ID NO:860, SEQ ID NO:86l, SEQ ID NO:862, SEQ ID NO:863, SEQ 
ID NO:864, SEQ ID NO:865, SEQ ID NO:866, SEQ ID NO:867, SEQ ID NO:868, SEQ ID NO:869, 
SEQ ID NO:870, SEQ ID NO:871, SEQ ID NO:872, SEQ ID NO: 873, SEQ ID NO:874, SEQ ID 
NO:875, SEQ ID NO:876, SEQ ID NO:877, SEQ ID NO:878, SEQ ID NO:879, SEQ ID NO:880, SEQ 
ID NO:881, SEQ ID NO:882, SEQ ID NO:883, SEQ ID NO:884, SEQ ID NO:885, SEQ ID NO:886, 
SEQ ID NO:887, SEQ ID NO:888, SEQ ID NO:889, SEQ ID NO:890, SEQ ID NO:891, SEQ ID 
NO:892, SEQ ID NO:893, SEQ ID NO:894, SEQ ID NO:895, SEQ ID NO:896, SEQ ID NO:897, SEQ 
ID NO:898, SEQ ID NO:899, SEQ ID NO:900, SEQ ID NO:901, SEQ ID NO:902, SEQ ID NO:903, 
SEQ ID NO:904, SEQ ID NO:905, SEQ ID NO:906, SEQ ID NO.907, SEQ ID NO:908, SEQ ID 
NO:909, SEQ ID NO:910, SEQ ID NO:91 1, SEQ ID NO:912, SEQ ID NO:913, SEQ ID NO:914, SEQ 
ID NO:915, SEQ ID NO:916, SEQ ID NO:917, SEQ ID NO:918, SEQ ID NO:919, SEQ ID NO:920, 
SEQ ID NO:921, SEQ ID NO:922, SEQ ID NO:923, SEQ ID NO:924, SEQ ID NO:925, SEQ ID 
NO:926, SEQ ID NO:927, SEQ ID NO:928, SEQ ID NO.929, SEQ ID NO:930, SEQ ID NO:931, SEQ 
ID NO:932, SEQ ID NO:933, SEQ ID NO:934, SEQ ID NO:935, SEQ ID NO:936, SEQ ID NO:937, 
SEQ ID NO:938, SEQ ID NO:939, SEQ ID NO:940, SEQ ID NO:941, SEQ ID NO:942, SEQ ID 
NO:943, SEQ ID NO:944, SEQ ID NO:945, SEQ ID NO:946, SEQ ID NO:947, SEQ ID NO:948, SEQ 
ID NO:949, SEQ ID NO:950, SEQ ID NO:951, SEQ ID NO:952, SEQ ID NO:953, SEQ ID NO:954, 
SEQ ID NO:955, SEQ ID NO:956, SEQ ID NO:957, SEQ ID NO:958, SEQ ID NO:959, SEQ ID 
NO:960, SEQ ID NO:961, SEQ ID NO:962, SEQ ID NO:963, SEQ ID NO:964, SEQ ID NO:965, SEQ 
ID NO:966, SEQ ID NO:967, SEQ ID NO:968, SEQ ID NO:969, SEQ ID NO:970, SEQ ID NO:971, 
SEQ ID NO:972, SEQ ID NO:973, SEQ ID NO:974, SEQ ID NO:975, SEQ ID NO:976, SEQ ID 
NO:977, SEQ ID NO:978, SEQ ID NO:979, SEQ ID NO:980, SEQ ID NO:981, SEQ ID NO:982, SEQ 
ID NO:983, SEQ ID NO:984, SEQ ID NO:985, SEQ ID NO:986, SEQ ID NO:987, SEQ ID NO:988, 
SEQ ID NO:989, SEQ ID NO:990, SEQ ID NO:991, SEQ ID NO:992, SEQ ID NO:993, SEQ ID 
NO:994, SEQ ID NO:995, SEQ ID NO:996, SEQ ID NO:997, SEQ ID NO:998, SEQ ID NO:999, SEQ 
ID NO: 1000, SEQ ID NO: 1001, SEQ ID NO: 1002, SEQ ID NO: 1003, SEQ ID NO: 1004, SEQ ID 
NO: 1005, SEQ ID NO: 1006, SEQ ID NO: 1007, SEQ ID NO: 1008, SEQ ID NO: 1009, SEQ ID 
NO:1010, SEQ IDNO:1011, SEQ ID NO:1012, SEQ ID NO:1013, SEQ ID NO:1014, SEQ ID 
NO: 101 5, SEQ ID NO: 1016, SEQ ID NO: 101 7, SEQ ID NO: 1018, SEQ ID NO: 101 9, SEQ ID 
NO:1020, SEQ ID NO:1021, SEQ ID NO:1022, SEQ ID NO:1023, SEQ ID NO:1024, SEQ ID 
NO:1025, SEQ ID NO:1026, SEQ ID NO:1027, SEQ ID NO:1028, SEQ ID NO:1029, SEQ ID 
NO:1030, SEQ ID NO:1031, SEQ ID NO:1032, SEQ ID NO:1033, SEQ ID NO:1034, SEQ ID 
NO:1035, SEQ ID NO:1036, SEQ ID NO:1037, SEQ ID NO:1038, SEQ ID NO:1039, SEQ ID 
NO:1040, SEQ ID NO: 1041, SEQ ID NO:1042, SEQ ID NO:1043, SEQ ID NO:1044, SEQ ID 
NO:1045, SEQ ID NO.1046, SEQ ID NO:1047, SEQ ID NO:1048, SEQ ID NO:1049, SEQ ID 
NO: 1050, SEQ ID NO: 1051, SEQ ID NO: 1052, SEQ ID NO: 1053, SEQ ID NO: 1054, SEQ ID 
NO:1055, SEQ ID NO:1056, SEQ ID NO:1057, SEQ ID NO:1058, SEQ ID NO:1059, SEQ ID 
NO:1060, SEQ ID NO:1061, SEQ ID NO:1062, SEQ ID NO:1063, SEQ ID NO:1064, SEQ ID 
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NO: 1065, SEQ ID NO: 1066, SEQ ID NO:1067, 
NO: 1070, SEQ ID NO: 1071, SEQ ID NO: 1072, 
NO: 1075, SEQ ID NO: 1076, SEQ ID NO: 1077, 
NO: 1 080, SEQ ID NO: 1081, SEQ ID NO: 1082, 
NO:1085, SEQ ID NO:1086, SEQ ID NO:1087, 
NO: 1090, SEQ ID NO: 1091, SEQ ID NO: 1092, 
NO:1095, SEQ ID NO:1096, SEQ ID NO:1097, 
NO: 1 100, SEQ ID NO: 1 101, SEQ ID NO: 1 102, 
NO: 1 105, SEQ ID NO: 1 106, SEQ ID NO: 1 107, 
NO:l 1 10, SEQ ID NO:l 1 1 1, SEQ ID NO:l 1 12, 
NO: 1115, SEQ ID NO: 1116, SEQ ID NO: 1 1 1 7, 
NO: 1 1 20, SEQ ID NO: 1121, SEQ ID NO: 1 122, 
NO:l 125, SEQ ID NO:l 126, SEQ ID NO:l 127, 
NO: 1 1 30, SEQ ID NO: 1 1 3 1 , SEQ ID NO: 1 1 32, 
NO: 1 135, SEQ ID NO: 1 136, SEQ ID NO: 1137, 
NO: 1 140, SEQ ID NO: 1 141, SEQ ID NO:1142, 
NO: 1 145, SEQ ID NO: 1 146, SEQ ID NO: 1 147, 
NO: 1150, SEQ ID NO: 1 1 5 1 , SEQ ID NO: 1 1 52, 
NO: 1 155, SEQ ID NO: 1 156, SEQ ID NO: 11 57, 
NO: 1 1 60, SEQ ID NO: 1 1 6 1 , SEQ ID NO: 1 1 62, 
NO:l 165, SEQ ID NO: 1 166, SEQ ID NO: 1 167, 
NO: 1 170, SEQ ID NO: 1 1 7 1 , SEQ ID NO: 1 1 72, 
NO: 1 175, SEQ ID NO: 1 176, SEQ ID NO: 1 177, 
NO: 1180, SEQ ID NO: 1181, SEQ ID NO: 1182, 
NO: 1 1 85, SEQ ID NO: 1 1 86, SEQ ID NO: 1 1 87, 
NO:l 190, SEQ ID NO: 1 191, SEQ ID NO: 1192, 
NO: 1 195, SEQ ID NO:l 196, SEQ ID NO: 1197, 
NO:1200, SEQ ID NO:1201, SEQ ID NO:1202, 
NO:1205, SEQ ID NO:1206, SEQ ID NO:1207, 
NO:1210, SEQ ID NO:121 1, SEQ ID NO:1212, 
NO: 1 2 1 5, SEQ ID NO: 1216, SEQ ID NO: 1 2 1 7, 
NO: 1 220, SEQ ID NO: 1 22 1 , SEQ ID NO: 1 222, 
NO: 1225, SEQ ID NO: 1226, SEQ ID NO: 1227, 
NO: 1 230, SEQ ID NO: 1231, SEQ ID NO: 1 232, 
NO:1235, SEQ ID NO:1236, SEQ ID NO:1237, 
NO: 1 240, SEQ ID NO: 1 24 1 , SEQ ID NO: 1 242, 
NO: 1245, SEQ ID NO: 1246, SEQ ID NO: 1247, 
NO: 1250, SEQ ID NO: 1251, SEQ ID NO: 1252, 
NO: 1255, SEQ ID NO: 1256, SEQ ID NO: 1257, 
NO: 1 260, SEQ ID NO: 1 26 1 , SEQ ID NO: 1 262, 



SEQ ID NO: 1068, SEQ ID NO: 1069, SEQ ID 
SEQ ID NO: 1073, SEQ ID NO: 1074, SEQ ID 
SEQ ID NO: 1078, SEQ ID NO: 1079, SEQ ID 
SEQ ID NO: 1 083, SEQ ID NO: 1 084, SEQ ID 
SEQ ID NO: 1088, SEQ ID NO: 1089, SEQ ID 
SEQ ID NO: 1093, SEQ ID NO: 1 094, SEQ ID 
SEQ ID NO: 1098, SEQ ID NO: 1099, SEQ ID 
SEQ ID NO:l 103, SEQ ID NO: 1 104, SEQ ID 
SEQ ID NO: 1 108, SEQ ID NO: 1 109, SEQ ID 
SEQ ID NO: 1113, SEQ ID NO: 1114, SEQ ID 
SEQ ID NO: 1 1 1 8, SEQ ID NO: 1119, SEQ ID 
SEQ ID NO: 1 123, SEQ ID NO: 1 124, SEQ ID 
SEQ ID NO: 1 128, SEQ ID NO: 1 129, SEQ ID 
SEQ ID NO: 1 133, SEQ ID NO: 1 134, SEQ ID 
SEQ ED NO:l 138, SEQ ID NO:l 139, SEQ ID 
SEQ ID NO: 1143, SEQ ID NO: 1 144, SEQ ID 
SEQ ID NO: 1 148, SEQ ID NO: 1 149, SEQ ID 
SEQ ID NO: 1 153, SEQ ID NO: 1 154, SEQ ID 
SEQ ID NO: 1 158, SEQ ID NO: 1 159, SEQ ID 
SEQ ID NO: 1 163, SEQ ID NO: 1 164, SEQ ID 
SEQ ID NO: 1 1 68, SEQ ID NO: 1 1 69, SEQ ID 
SEQ ID NO:l 173, SEQ ID NO:l 174, SEQ ID 
SEQ ED NO:l 178, SEQ ID NO:l 179, SEQ ID 
SEQ ID NO: 1 183, SEQ ID NO: 1 184, SEQ ID 
SEQ ID NO: 1 188, SEQ ID NO: 1 189, SEQ ID 
SEQ ID NO: 1 193, SEQ ID NO: 1 194, SEQ ID 
SEQ ID NO: 1 198, SEQ ID NO:l 199, SEQ ID 
SEQ ID NO:1203, SEQ ID NO:1204, SEQ ID 
SEQ ID NO: 1208, SEQ ID NO: 1209, SEQ ID 
SEQ ID NO: 1213, SEQ ID NO: 1214, SEQ ID 
SEQ ID NO: 1 2 1 8, SEQ ID NO: 1 2 1 9, SEQ ID 
SEQ ID NO: 1223, SEQ ID NO: 1224, SEQ ID 
SEQ ID NO: 1228, SEQ ID NO: 1229, SEQ ID 
SEQ ID NO: 1233, SEQ ID NO: 1234, SEQ ID 
SEQ ID NO: 1238, SEQ ID NO: 1239, SEQ ID 
SEQ ID NO: 1243, SEQ ID NO: 1244, SEQ ID 
SEQ ID NO: 1 248, SEQ ID NO: 1 249, SEQ ID 
SEQ ID NO: 1253, SEQ ID NO: 1254, SEQ ID 
SEQ ID NO: 125 8, SEQ ID NO: 1259, SEQ ID 
SEQ ID NO: 1263, SEQ ID NO: 1264, SEQ ID 
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NO: 1 265, SEQ ID NO: 1 266, SEQ ID NO: 1 267, SEQ ID NO: 1 268, SEQ ID NO: 1 269, SEQ ID 
NO: 1 270, SEQ ID NO: 1271, SEQ ID NO: 1 272, SEQ ID NO: 1 273, SEQ ID NO: 1 274, SEQ ID 
NO: 1 275, SEQ ID NO: 1 276, SEQ ID NO: 1 277, SEQ ID NO: 1 278, SEQ ID NO: 1 279, SEQ ID 
NO: 1 280, SEQ ID NO: 1281, SEQ ID NO: 1 282, SEQ ID NO: 1 283, SEQ ID NO: 1 284, SEQ ID 
NO: 1285, SEQ ID NO: 1286, SEQ ID NO: 1287, SEQ ID NO: 1288, SEQ ID NO: 1289, SEQ ID 
NO: 1290, SEQ ID NO: 1291, SEQ ID NO: 1292, SEQ ID NO: 1293, SEQ ID NO: 1294, SEQ ID 
NO: 1295, SEQ ID NO: 1296, SEQ ID NO: 1297, SEQ ID NO: 1298, SEQ ID NO: 1299, SEQ ID 
NO: 1 300, SEQ ID NO: 1 30 1 , SEQ ID NO: 1 302, SEQ ID NO: 1 303, SEQ ID NO: 1 304, SEQ ID 
NO: 1305, SEQ ID NO: 1306, SEQ ID NO: 1307, SEQ ID NO: 1308, SEQ ID NO: 1309, SEQ ID 
NO: 1 3 1 0, SEQ ID NO: 1 3 1 1 , SEQ ID NO: 1312, SEQ ID NO: 1 3 1 3 , SEQ ID NO: 1 3 1 4, SEQ ID 
NO: 1 3 1 5, SEQ ID NO: 1 3 1 6, SEQ ID NO: 1 3 1 7, SEQ ID NO: 1 3 1 8, SEQ ID NO: 1 3 1 9, SEQ ID 
NO:I320, SEQ ID NO:1321, SEQ ID NO:1322, SEQ ID NO:1323, SEQ ID NO:1324, SEQ ID 
NO: 1325, SEQ ID NO: 1326, SEQ ID NO: 1656, SEQ ID NO: 1657, SEQ ID NO: 1658, SEQ ID 
NO: 1 659, SEQ ID NO: 1 660, SEQ ID NO: 1 66 1 , SEQ ID NO: 1 662, SEQ ID NO: 1 663, SEQ ID 
NO: 1664, SEQ ID NO: 1665, SEQ ID NO: 1666, SEQ ID NO: 1667, SEQ ID NO: 1668, SEQ ID 
NO: 1669, SEQ ID NO: 1670, SEQ ID NO: 1671, SEQ ID NO: 1672, SEQ ID NO: 1673, SEQ ID 
NO: 1674, SEQ ID NO: 1675, SEQ ID NO: 1676, SEQ ID NO: 1677, SEQ ID NO: 1678, SEQ ID 
NO: 1679, SEQ ID NO: 1680, SEQ ID NO: 1681, SEQ ID NO: 1682, SEQ ID NO: 1683, SEQ ID 
NO: 1684, SEQ ID NO: 1685, SEQ ID NO: 1686, SEQ ID NO: 1687, SEQ ID NO: 1688, SEQ ID 
NO: 1689, SEQ ID NO: 1690, SEQ ID NO: 1691, SEQ ID NO: 1692, SEQ ID NO: 1693, SEQ ID 
NO: 1694, SEQ ID NO: 1695, SEQ ID NO: 1696, SEQ ID NO: 1697, SEQ ID NO: 1698, SEQ ID 
NO.1699, SEQ ID NO:1700, SEQ ID NO:1701, SEQ ID NO:1702, SEQ ID NO:1703, SEQ ID 
NO: 1704, SEQ ID NO: 1705, SEQ ID NO: 1706, SEQ ID NO: 1707, SEQ ID NO: 1708, SEQ ID 
NO:1709, SEQ ID NO:1710, SEQ ID NO:171 1, SEQ ID NO:1712, SEQ ID NO:1713, SEQ ID 
NO:1714, SEQ ID NO:1715, SEQ ID NO:1716, SEQ ID NO:1717, SEQ ID NO:1718, SEQ ID 
NO: 1 7 1 9, SEQ ID NO: 1 720, SEQ ID NO: 1 72 1 , SEQ ID NO: 1 722, SEQ ID NO: 1 723, SEQ ID 
NO:1724, SEQ ID NO:1725, SEQ ID NO:1726, SEQ ID NO:1727, SEQ ID NO:1728, SEQ ID 
NO: 1729, SEQ ID NO: 1730, SEQ ID NO: 1731, SEQ ID NO: 1732, SEQ ID NO: 1733, SEQ ID 
NO: 1734, SEQ ID NO: 1735, SEQ ID NO: 1736, SEQ ID NO: 1737, SEQ ID NO: 1738, SEQ ID 
NO:1739, SEQ ID NO:1740, SEQ ID NO:1741, SEQ ID NO:1742, SEQ ID NO:1743, SEQ ID 
NO: 1744, SEQ ID NO: 1745, SEQ ID NO: 1746, SEQ ID NO: 1747, SEQ ID NO: 1748, SEQ ID 
NO. 1749, SEQ ID NO:1750, SEQ ID NO:1751, SEQ ID NO:1752, SEQ ID NO:1753, SEQ ID 
NO:1754, SEQ ID NO:1755, SEQ ID NO:1756, SEQ ID NO:1757, SEQ ID NO:1758, SEQ ID 
NO: 1 759, SEQ ID NO: 1 760, SEQ ID NO: 1 76 1 , SEQ ID NO: 1 762, SEQ ID NO: 1 763, SEQ ID 
NO: 1 764, SEQ ID NO: 1 765, SEQ ID NO: 1 766, SEQ ID NO: 1 767, SEQ ID NO: 1 768, SEQ ID 
NO: 1 769, SEQ ID NO: 1 770, SEQ ID NO: 1 77 1 , SEQ ID NO: 1 772, SEQ ID NO: 1 773, SEQ ID 
NO: 1 774, SEQ ID NO: 1 775, SEQ ID NO: 1 776, SEQ ID NO: 1 777, SEQ ID NO: 1 778, SEQ ID 
NO: 1 779, SEQ ID NO: 1 780, SEQ ID NO: 1781, SEQ ID NO: 1 782, SEQ ID NO: 1 783, SEQ ID 
NO: 1784, SEQ ID NO: 1785, SEQ ID NO: 1786, SEQ ID NO: 1787, SEQ ID NO: 1788, SEQ ID 
NO: 1 789, SEQ ID NO: 1 790, SEQ ID NO: 1 79 1 , SEQ ID NO: 1 792, SEQ ID NO: 1 793, SEQ ID 
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NO: 1 794, SEQ ID NO: 1 795, SEQ ID NO: 1 796, SEQ ID NO: 1 797, SEQ ID NO: 1 798, SEQ ID 
NO: 1 799, SEQ ID NO: 1 800, SEQ ID NO: 1 801 , SEQ ID NO: 1 802, SEQ ID NO: 1 803, SEQ ID 
NO: 1 804, SEQ ID NO: 1 805, SEQ ID NO: 1 806, SEQ ID NO: 1 807, SEQ ID NO: 1 808, SEQ ID 
NO: 1 809, SEQ ID NO: 1 8 1 0, SEQ ID NO: 1 8 1 1 , SEQ ID NO: 1 8 1 2, SEQ ID NO: 181 3, SEQ ID 
NO: 1 8 1 4, SEQ ID NO: 1 8 1 5, SEQ ID NO: 1 8 1 6, SEQ ID NO: 1 8 1 7, SEQ ID NO: 1 8 1 8, SEQ ID 
NO: 1 8 1 9, SEQ ID NO: 1 820, SEQ ID NO: 1 82 1 , SEQ ID NO: 1 822, SEQ ID NO: 1 823, SEQ ID 
NO: 1824, SEQ ID NO: 1825, SEQ ID NO: 1826, SEQ ID NO: 1827, SEQ ID NO: 1828, SEQ ID 
NO: 1 829, SEQ ID NO: 1 830, SEQ ID NO: 1 83 1 , SEQ ID NO: 1 832, SEQ ID NO: 1 833, SEQ ID 
NO: 1 834, SEQ ID NO: 1 835, SEQ ID NO: 1 836, SEQ ID NO: 1 837, SEQ ID NO: 1 838, SEQ ID 
NO: 1839, SEQ ID NO: 1840, SEQ ID NO: 1841, SEQ ID NO: 1842, SEQ ID NO: 1843, SEQ ID 
NO: 1844, SEQ ID NO: 1845, SEQ ID NO: 1846, SEQ ID NO: 1847, SEQ ID NO: 1848, SEQ ID 
NO: 1 849, SEQ ID NO: 1 850, SEQ ID NO: 1 85 1 , SEQ ID NO: 1 852, SEQ ID NO: 1 853, SEQ ID 
NO: 1854, SEQ ID NO: 1855, SEQ ID NO: 1856, SEQ ID NO: 1857, SEQ ID NO: 1858, SEQ ID 
NO: 1 859, SEQ ID NO: 1 860, SEQ ID NO: 1 86 1 , SEQ ID NO: 1 862, SEQ ID NO: 1 863, SEQ ID 
NO: 1 864, SEQ ID NO: 1 865, SEQ ID NO: 1 866, SEQ ID NO: 1 867, SEQ ID NO: 1 868, SEQ ID 
NO:1869, SEQ ID NO:1870, SEQ ID NO:1871, SEQ ID NO:1872, SEQ ID NO:1873, SEQ ID 
NO: 1 874, SEQ ID NO: 1 875, SEQ ID NO: 1 876, SEQ ID NO: 1 877, SEQ ID NO: 1 878, SEQ ID 
NO: 1 879, SEQ ID NO: 1 880, SEQ ID NO: 1 88 1 , SEQ ID NO: 1 882, SEQ ID NO: 1 883, SEQ ID 
NO: 1884, SEQ ID NO: 1885, SEQ ID NO: 1886, SEQ ID NO: 1887, SEQ ID NO: 1888, SEQ ID 
NO: 1889, SEQ ID NO: 1890, SEQ ID NO: 1891, SEQ ID NO: 1892, SEQ ID NO: 1893, SEQ ID 
NO: 1894, SEQ ID NO: 1895, SEQ ID NO: 1896, SEQ ID NO: 1897, SEQ ID NO: 1898, SEQ ID 
NO: 1899, SEQ ID NO: 1900, SEQ ID NO: 1901, SEQ ID NO: 1902, SEQ ID NO: 1903, SEQ ID 
NO:1904, SEQ ID NO:1905, SEQ ID NO:1906, SEQ ID NO:1907, SEQ ID NO:1908, SEQ ID 
NO:1909, SEQ ID NO:1910, SEQ ID NO:191 1, SEQ ID NO:1912, SEQ ID NO:1913, SEQ ID 
NO:1914, SEQ ID NO:1915, SEQ ID NO:1916, SEQ ID NO:1917, SEQ ID NO:1918, SEQ ID 
NO: 1919, SEQ ID NO: 1920, SEQ ID NO: 1921, SEQ ID NO: 1922, SEQ ID NO: 1923, SEQ ID 
NO: 1924, SEQ ID NO: 1925, SEQ ID NO: 1926, SEQ ID NO: 1927, SEQ ID NO: 1928, SEQ ID 
NO: 1929, SEQ ID NO: 1930, SEQ ID NO: 1931, SEQ ID NO: 1932, SEQ ID NO: 1933, SEQ ID 
NO: 1934, SEQ ID NO: 1935, SEQ ID NO: 1936, SEQ ID NO: 1937, SEQ ID NO: 1938, SEQ ID 
NO:1939, SEQ ID NO:1940, SEQ ID NO:1941, SEQ ID NO:1942, SEQ ID NO: 1943, SEQ ID 
NO: 1 944, SEQ ID NO: 1 945, SEQ ID NO: 1 946, SEQ ID NO: 1 947, SEQ ID NO: 1 948, SEQ ID 
NO: 1949, SEQ ID NO: 1950, SEQ ID NO: 1951, SEQ ID NO: 1952, SEQ ID NO: 1953, SEQ ID 
NO:1954, SEQ ID NO:1955, SEQ ID NO:1956, SEQ ID NO:1957, SEQ ID NO: 1958, SEQ ID 
NO:1959, SEQ ID NO:1960, SEQ ID NO:1961, SEQ ID NO:1962, SEQ ID NO:1963, SEQ ID 
NO: 1964, SEQ ID NO: 1965, SEQ ID NO: 1966, SEQ ID NO: 1967, SEQ ID NO: 1968, SEQ ID 
NO: 1969, SEQ ID NO: 1970, SEQ ID NO: 1971, SEQ ID NO: 1972, SEQ ID NO: 1973, SEQ ID 
NO: 1974, SEQ ID NO: 1975, SEQ ID NO: 1976, SEQ ID NO: 1977, SEQ ID NO: 1978, SEQ ID 
NO: 1979, SEQ ID NO: 1980, SEQ ID NO: 1981, SEQ ID NO: 1982, SEQ ID NO: 1983, SEQ ID 
NO: 1 984, SEQ ID NO: 1 985, SEQ ID NO: 1 986, SEQ ID NO: 1 987, SEQ ID NO: 1 988, SEQ ID 
NO: 1 989, SEQ ID NO: 1 990, SEQ ID NO: 1 99 1 , SEQ ID NO: 1 992, SEQ ID NO: 1 993, SEQ ID 
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NO: 1994, SEQ ID NO: 1995, SEQ ID NO: 1 996, SEQ ID NO: 1997, SEQ ID NO: 1998, SEQ ID 
NO: 1999, SEQ ID NO:2000, SEQ ID NO:2001, SEQ ID NO:2002, SEQ ID NO:2003, SEQ ID 
NO:2004, SEQ ID NO:2005, SEQ ID NO:2006, SEQ ID NO:2007, SEQ ID NO:2008, SEQ ID 
NO:2009, SEQ ID NO:2010, SEQ ID NO:201 1, SEQ ID NO.2012, SEQ ID NO.2013, SEQ ID 
NO-.2014, SEQ ID NO:2015, SEQ ID NO:2016, SEQ ID NO:2017, SEQ ID NO:2018. SEO ID 
NO:2019, SEQ ID NO:2020, SEQ ID NO:2021, SEQ ID NO:2022, SEQ ID NO:2023, SEQ ID 
NO:2024, SEQ ID NO:2025, SEQ ID NO:2026, SEQ ID NO:2027, SEQ ID NO:2028, SEQ ID 
NO:2029, SEQ ID NO:2030, SEQ ID NO.203 1, SEQ ID NO.2032, SEQ ID NO:2033, SEQ ID 
NO:2034, SEQ ID NO.2035, SEQ ID NO:2036, SEQ ID NO:2037, SEQ ID NO:2038, SEQ ID 
NO:2039, SEQ ID NO:2040, SEQ ID NO:2041, SEQ ID NO:2042, SEQ ID NO:2043, SEQ ID 
NO:2044, SEQ ID NO:2045, SEQ ID NO:2046, SEQ ID NO:2047, SEQ ID NO:2048, SEQ ID 
NO:2049, SEQ ID NO:2050, SEQ ID NO:2051, SEQ ID NO:2052, SEQ ID NO:2053, SEQ ID 
NO:2054, SEQ ID NO:2055, SEQ ID NO:2056, SEQ ID NO:2057, SEQ ID NO:2058, SEQ ID 
NO-.2059, SEQ ID NO:2060, SEQ ID NO.2061, SEQ ID NO:2062, SEQ ID NO:2063, SEQ ID 
NO:2064, SEQ ID NO:2065, SEQ ID NO:2066, SEQ ID NO:2067, SEQ ID NO:2068, SEQ ID 
NO:2069, SEQ ID NO:2070, SEQ ID NO:2071, SEQ ID NO:2072, SEQ ID NO:2073, SEQ ID 
NO:2074, SEQ ID NO:2075, SEQ ID NO.2076, SEQ ID NO.2077, SEQ ID NO.2078, SEQ ID 
NO.2079, SEQ ID NO:2080, SEQ ID NO.2081, SEQ ID NO:2082, SEQ ID NO:2083, SEQ ID 
NO:2084, SEQ ID NO:2085, SEQ ID NO:2086, SEQ ID NO:2087, SEQ ED NO:2088, SEQ ID 
NO:2089, SEQ ID NO:2090, SEQ ID NO:2091, SEQ ID NO:2092, SEQ ID NO:2093, SEQ ID 
NO:2094, SEQ ID NO:2095, SEQ ID NO:2096, SEQ ID NO:2097, SEQ ID NO:2098, SEQ ID 
NO:2099, SEQ ID NO.2100, SEQ ID NO.2101, SEQ ID NO:2102, SEQ ID NO:2103, SEQ ID 
NO:2104, SEQ IDNO:2105, SEQ ID NO:2106, SEQ ID NO:2107, SEQ ID NO:2108, SEQ ID 
NO:2109, SEQ ID NO:21 10, SEQ ID NO:21 1 1, SEQ ID NO:21 12, SEQ ID NO:21 13, SEQ ID 
NO:21 14, SEQ ID NO:21 15, SEQ ID NO:21 16, SEQ ID NO:21 17, SEQ ID NO:21 18, SEQ ID 
NO:21 19, SEQ ID NO:2120, SEQ ED NO:2121, SEQ ID NO.2122, SEQ ID NO.2123, SEQ ID 
NO:2124, SEQ ID NO.2125, SEQ ID NO:2126, SEQ ID NO.2127, SEQ ID NO.2128, SEQ ID 
NO:2129, SEQ ID NO:2130, SEQ ID NO:2131, SEQ ID NO:2132, SEQ ID NO:2133, SEQ ID 
NO:2134, SEQ ID NO:2135, SEQ ID NO:2136, SEQ ID NO:2137, SEQ ID NO:2138, SEQ ID 
NO:2139, SEQ ID NO:2140, SEQ ID NO:2141, SEQ ID NO.2142, SEQ ID NO.2143, SEQ ID 
NO:2144, SEQ ID NO:2145, SEQ ID NO:2146, SEQ ID NO:2147, SEQ ID NO:2148, SEQ ID 
NO:2149, SEQ ID NO.2150, SEQ ID NO:2151. Alternatively, the PCR uses corresponding probes 
consisting of nucleotide sequences selected from the group consisting of SEQ ID NO: 1327, SEQ ED 
NO: 1328, SEQ ID NO: 1329, SEQ ID NO: 1330, SEQ ED NO: 1331, SEQ ID NO. 1332, SEQ ID 
NO:1333, SEQ ID NO:1334, SEQ ID NO:1335, SEQ ID NO.1336, SEQ ED NO:1337, SEQ ID 
NO: 1338, SEQ ED NO: 1339, SEQ EO NO: 1340, SEQ ID NO: 1341, SEQ ID NO: 1342, SEQ ID 
NO: 1343, SEQ ID NO: 1344, SEQ ID NO: 1345, SEQ ID NO: 1346, SEQ ID NO: 1347, SEQ ID 
NO:1348, SEQ ID NO:1349, SEQ ID NO:1350, SEQ EO NO:1351, SEQ ED NO:1352, SEQ ID 
NO: 1353, SEQ ED NO: 1354, SEQ ID NO: 1355, SEQ ID NO: 1356, SEQ ID NO: 1357, SEQ ID 
NO: 1358, SEQ ID NO: 1 359, SEQ ID NO: 1 360, SEQ ID NO: 1 361, SEQ ID NO: 1 362, SEQ ID 
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NO: 1363, SEQ ID NO: 1364, SEQ ID NO: 1365, SEQ ID NO: 1366, SEQ ID NO: 1367, SEQ ID 
NO: 1 368, SEQ ID NO: 1 369, SEQ ID NO: 1 370, SEQ ID NO: 1371, SEQ ID NO: 1 372, SEQ ID 
NO: 1373, SEQ ID NO: 1374, SEQ ID NO: 1375, SEQ ID NO: 1376, SEQ ID NO: 1377, SEQ ID 
NO: 1378, SEQ ID NO: 1 379, SEQ ID NO: 1 380, SEQ ID NO: 1 38 1 , SEQ ID NO: 1 382, SEQ ID 
NO: 1383, SEQ ID NO: 1384, SEQ ID NO: 1385, SEQ ID NO: 1386, SEQ ID NO: 1387, SEQ ID 
NO: 1388, SEQ ID NO: 1 389, SEQ ID NO: 1 390, SEQ ID NO: 1 39 1 , SEQ ID NO: 1 392, SEQ ID 
NO: 1393, SEQ ID NO: 1 394, SEQ ID NO: 1395, SEQ ID NO: 1396, SEQ ID NO: 1397, SEQ ID 
NO: 1398, SEQ ID NO: 1399, SEQ ID NO: 1400, SEQ ID NO: 1401, SEQ ID NO: 1402, SEQ ID 
NO: 1403, SEQ ID NO: 1404, SEQ ID NO: 1405, SEQ ID NO: 1406, SEQ ID NO: 1407, SEQ ID 
NO: 1408, SEQ ID NO: 1 409, SEQ ID NO: 1410, SEQ ID NO: 1411, SEQ ID NO: 1412, SEQ ID 
NO:1413, SEQ ID NO:1414, SEQ ID NO:1415, SEQ ID NO:1416, SEQ ID NO:1417, SEQ ID 
NO: 14 18, SEQ ID NO: 14 19, SEQ ID NO: 1420, SEQ ID NO: 1421, SEQ ID NO: 1422, SEQ ID 
NO:1423, SEQ ID NO:1424, SEQ ID NO:1425, SEQ ID NO:1426, SEQ ID NO:1427, SEQ ID 
NO:1428, SEQ ID NO:1429, SEQ ID NO:1430, SEQ ID NO:1431, SEQ ID NO:1432, SEQ ID 
NO: 1433, SEQ ID NO: 1434, SEQ ID NO: 1435, SEQ ID NO: 1436, SEQ ID NO: 1437, SEQ ID 
NO:1438, SEQ ID NO:1439, SEQ ID NO:1440, SEQ ID NO:1441, SEQ ID NO: 1442, SEQ ID 
NO: 1443, SEQ ID NO:1444, SEQ ID NO:1445, SEQ ID NO: 1446, SEQ ID NO: 1447, SEQ ID 
NO: 1448, SEQ ID NO: 1449, SEQ ID NO: 1450, SEQ ID NO: 1451, SEQ ID NO: 1452, SEQ ID 
NO: 1454, SEQ ID NO: 1455, SEQ ID NO: 1456, SEQ ID NO: 1457, SEQ ID NO: 1458, SEQ ID 
NO:1459, SEQ ID NO:1460, SEQ ID NO:1461, SEQ ID NO:1462, SEQ ID NO:1463, SEQ ID 
NO: 1464, SEQ ID NO: 1465, SEQ ID NO: 1466, SEQ ID NO: 1467, SEQ ID NO: 1468, SEQ ID 
NO:1469, SEQ ID NO:1470, SEQ ID NO:1471, SEQ ID NO:1472, SEQ ID NO:1473, SEQ ID 
NO:1474, SEQ ID NO:1475, SEQ ID NO:1476, SEQ ID NO:1477, SEQ ID NO: 1478, SEQ ID 
NO:1479, SEQ ID NO:1480, SEQ ID NO:1481, SEQ ID NO:1482, SEQ ID NO:1483, SEQ ID 
NO: 1484, SEQ ID NO: 1485, SEQ ID NO: 1486, SEQ ID NO: 1487, SEQ ID NO: 1488, SEQ ID 
NO: 1489, SEQ ID NO: 1490, SEQ ID NO: 1491, SEQ ID NO: 1492, SEQ ID NO: 1493, SEQ ID 
NO: 1494, SEQ ID NO: 1495, SEQ ID NO: 1496, SEQ ID NO: 1497, SEQ ID NO: 1498, SEQ ID 
NO: 1499, SEQ ID NO: 1500, SEQ ID NO: 1501, SEQ ID NO: 1502, SEQ ID NO: 1503, SEQ ID 
NO:1504, SEQ ID NO:1505, SEQ ID NO:1506, SEQ ID NO: 1507, SEQ ID NO: 1508, SEQ ID 
NO: 1 509, SEQ ID NO: 1510, SEQ ID NO: 1511, SEQ ID NO: 1512, SEQ ID NO: 1 5 1 3, SEQ ID 
NO: 1 5 1 4, SEQ ID NO: 1515, SEQ ID NO: 1516, SEQ ID NO: 1517, SEQ ID NO: 1518, SEQ ID 
NO: 1519, SEQ ID NO: 1520, SEQ ID NO: 1521, SEQ ID NO: 1522, SEQ ID NO: 1523, SEQ ID 
NO:1524, SEQ ID NO:1525, SEQ ID NO:1526, SEQ ID NO:1527, SEQ ID NO:1528, SEQ ID 
NO: 1 529, SEQ ID NO: 1 530, SEQ ID NO: 1 53 1 , SEQ ID NO: 1 532, SEQ ID NO: 1 533, SEQ ID 
NO:1534, SEQ ID NO:1535, SEQ ID NO:1536, SEQ ID NO:1537, SEQ ID NO:1538, SEQ ID 
NO: 1539, SEQ ID NO: 1 540, SEQ ID NO: 1 54 1 , SEQ ID NO: 1 542, SEQ ID NO: 1 543, SEQ ED 
NO: 1544, SEQ ID NO: 1545, SEQ ID NO: 1546, SEQ ID NO: 1547, SEQ ID NO: 1548, SEQ ID 
NO: 1549, SEQ ID NO: 1550, SEQ ID NO: 1551, SEQ ID NO: 1552, SEQ ID NO: 1553, SEQ ID 
NO: 1554, SEQ ID NO: 1555, SEQ ID NO: 1556, SEQ ID NO: 1557, SEQ ID NO: 1558, SEQ ID 
NO: 1559, SEQ ID NO: 1560, SEQ ID NO: 1561, SEQ ID NO: 1562, SEQ ID NO: 1563, SEQ ID 
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NO: 1 564, SEQ ID NO: 1 565, SEQ ID NO: ! 566, SEQ ID NO: 1 567, SEQ ID NO: 1 568, SEQ ID 
NO: 1 569, SEQ ID NO: 1 570, SEQ ID NO: 1571, SEQ ID NO: 1 572, SEQ ID NO: 1 573, SEQ ID 
NO: 1 574, SEQ ID NO: 1 575, SEQ ID NO: 1 576, SEQ ID NO: 1 577, SEQ ID NO: 1 578, SEQ ID 
NO: 1 579, SEQ ID NO: 1 580, SEQ ID NO: 1 58 1 , SEQ ID NO: 1 582, SEQ ID NO: 1 583, SEQ ID 
NO: 1 584, SEQ ID NO: 1 585,. SEQ ID NO: 1 586, SEQ ID NO: 1 587, SEQ ID NO: 1 588, SEQ ID 
NO: 1589, SEQ ID NO: 1590, SEQ ID NO: 1 591, SEQ ID NO: 1592, SEQ ID NO: 1593, SEQ ID 
NO: 1594, SEQ ID NO:, SEQ ID NO: 1595, SEQ ID NO: 1596, SEQ ID NO: 1597, SEQ ID NO: 1598, 
SEQ ID NO:1599, SEQ ID NO:1600, SEQ ID NO:1601, SEQ ID NO:1602, SEQ ID NO:1603, SEQ ID 
NO:1604, SEQ ID NO:1605, SEQ ID NO:1606, SEQ ID NO:1607, SEQ ID NO:1608, SEQ ID 
NO:1609, SEQ ID NO:1610, SEQ ID NO:161 1, SEQ ID NO:1612, SEQ ID NO:1613, SEQ ID 
NO:1614, SEQ ID NO:1615, SEQ ID NO:1616, SEQ ID NO:1617, SEQ ID NO: 161 8, SEQ ID 
NO: 1619, SEQ ID NO: 1620, SEQ ID NO: 1 621, SEQ ID NO: 1622, SEQ ID NO: 1623, SEQ ID 
NO: 1624, SEQ ID NO: 1625, SEQ ID NO: 1626, SEQ ID NO: 1627, SEQ ID NO: 1628, SEQ ID 
NO: 1629, SEQ ID NO: 1630, SEQ ID NO: 1631, SEQ ID NO: 1632, SEQ ID NO: 1633, SEQ ID 
NO: 1634, SEQ ID NO: 1635, SEQ ID NO: 1636, SEQ ID NO: 1637, SEQ ID NO: 1638, SEQ ID 
NO:1639, SEQ ID NO:1640, SEQ ID NO:1641, SEQ ID NO:1642, SEQ ID NO:1643, SEQ ID 
NO: 1644, SEQ ID NO: 1645, SEQ ID NO: 1646, SEQ ID NO: 1647, SEQ ID NO: 1648, SEQ ED 
NO: 1 649, SEQ ID NO: 1 650, SEQ ID NO: 1 65 1 , SEQ ID NO: 1 652, SEQ ID NO: 1 653, SEQ ID 
NO: 1654, SEQ ID NO: 1655, SEQ ID NO: 1656, SEQ ID NO: 1657, SEQ ID NO:2152, SEQ ID NO:, 
SEQ ID NO:2153, SEQ ID NO:, SEQ ID NO:2154, SEQ ID NO:, SEQ ID NO:, SEQ ID NO:, SEQ ID 
NO:2145, SEQ ID NO:, SEQ ID NO:2156, SEQ ID NO:2157, SEQ ID NO:2158, SEQ ID NO:2159, 
SEQ ID NO:, SEQ ID NO:2160, SEQ ID NO:2161, SEQ ID NO:2162, SEQ ID NO:2163, SEQ ID 
NO:2164, SEQ ID NO:, SEQ ID NO:2165, SEQ ID NO:, SEQ ID NO:2166, SEQ ID NO:2167, SEQ 
ID NO:2168, SEQ ID NO:2169, SEQ ID NO:2170, SEQ ID NO:2171, SEQ ID NO:2172, SEQ ID 
NO:2173, SEQ ID NO:2174, SEQ ID NO:2175, SEQ ID NO:2176, SEQ ID NO:2177, SEQ ID 
NO:2178, SEQ IDNO:2179, SEQ ID NO:2180, SEQ ID NO:2181, SEQ ID NO:2182, SEQ ID 
NO:2183, SEQ ID NO:2184, SEQ ID NO:2185, SEQ ID NO:2186, SEQ ID NO:2187, SEQ ID 
NO:2188, SEQ ID NO:2189, SEQ ID NO:2190, SEQ ID NO:2191, SEQ ID NO:2192, SEQ ID 
NO:2193, SEQ ID NO:2194, SEQ ID NO:2195, SEQ ID NO:2196, SEQ ID NO:2197, SEQ ID 
NO:2198, SEQ ID NO:2199, SEQ ID NO:2200, SEQ ID NO:2201, SEQ ID NO:2202, SEQ ID 
NO:2203, SEQ ID NO:2204, SEQ ID NO:2205, SEQ ID NO:2206, SEQ ID NO:2207, SEQ ID 
NO:2208, SEQ ID NO:2209, SEQ ID NO:2210, SEQ ID NO:221 1, SEQ ID NO:2212, SEQ ID 
NO:2213, SEQ ID NO:2214, SEQ ID NO:2215, SEQ ID NO:2216, SEQ ID NO:2217, SEQ ID 
NO:2218, SEQ ID NO:2219, SEQ ID NO:2220, SEQ ID NO:2221, SEQ ID NO:2222, SEQ ID 
NO:2223, SEQ ID NO:2224, SEQ ID NO:2225, SEQ ID NO:2226, SEQ ID NO:2227, SEQ ID 
NO:2228, SEQ ID N0.2229, SEQ ID NO:2230, SEQ ID NO:2231, SEQ ID NO:2232, SEQ ID 
NO:2233, SEQ ID NO:2234, SEQ ID NO:2235, SEQ ID NO:2236, SEQ ID NO:2237, SEQ ID 
NO:2238, SEQ ID NO:2239, SEQ ID NO:2240, SEQ ID NO:2241, SEQ ID NO:2242, SEQ ID 
NO:2243, SEQ ID NO:2244, SEQ ID NO:2245, SEQ ID NO:2246, SEQ ID NO:2247, SEQ ID 
NO:2248, SEQ ID NO:2249, SEQ ID NO:2250, SEQ ID NO:2251, SEQ ID NO:2252, SEQ ID 
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NO:2253, SEQ ID NO:2254, SEQ ID NO:2255, SEQ ID NO:2256, SEQ ID NO:2257, SEQ ID 
NO:2258, SEQ ID NO:2259, SEQ ID NO:2260, SEQ ID NO:2261, SEQ ID NO:2262, SEQ ID 
NO:2263, SEQ ID NO:2264, SEQ ID NO:2265, SEQ ID NO:2266, SEQ ID NO:2267, SEQ ID 
NO:2268, SEQ ID NO:2269, SEQ ID NO:2270, SEQ ID NO:2271, SEQ ID NO:2272, SEQ ID 
NO:2273, SEQ ID NO:2274, SEQ ID NO:2275^SEQ ID NO:2276, SEQ ID NO:2277, SEQ ID 
NO:2278, SEQ ID NO:2279, SEQ ID NO:2280, SEQ ID NO:2281, SEQ ID NO:2282, SEQ ID 
NO:2283, SEQ ID NO:2284, SEQ ID NO:2285, SEQ ID NO:2286, SEQ ID NO:2287, SEQ ID 
NO:2288, SEQ ID NO:2289, SEQ ID NO:2290, SEQ ID NO:2291, SEQ ID NO:2292, SEQ ID 
NO:2293, SEQ ID NO.2294, SEQ ID NO:2295, SEQ ID NO:2296, SEQ ID NO:2297, SEQ ID 
NO:2298, SEQ ID N0.2299, SEQ ID NO:2300, SEQ ID NO:2301, SEQ ID NO:2302, SEQ ID 
NO:2303, SEQ ID NO:2304, SEQ ID NO:2305, SEQ ID NO:2306, SEQ ID NO:2307, SEQ ID 
NO:2308, SEQ ID NO:2309, SEQ ID NO:2310, SEQ ID NO:231 1, SEQ ID NO:2312, SEQ ID 
NO:2313, SEQ ID NO:2314, SEQ ID NO:2315, SEQ ID NO:2316, SEQ ID NO:2317, SEQ ID 
NO:2318, SEQ ID NO:2319, SEQ ED NO:2320, SEQ ID NO:2321, SEQ ID NO:2322, SEQ ID 
NO:2323, SEQ ID NO:2324, SEQ ID NO:2325, SEQ ID NO:2326, SEQ ID NO:2327, SEQ ID 
NO:2328, SEQ ID NO:2329, SEQ ID NO:2330, SEQ ID NO:2331, SEQ ID NO:2332, SEQ ID 
NO:2333, SEQ ID NO:2334, SEQ ID NO:2335, SEQ ID NO:2336, SEQ ID NO:2337, SEQ ID 
NO:2338, SEQ ID NO:2339, SEQ ID NO:2340, SEQ ID NO:2341, SEQ ID NO:2342, SEQ ID 
NO:2343, SEQ ID NO:2344, SEQ ID NO:2345, SEQ ID NO:2346, SEQ ID NO:2347, SEQ ID 
NO:2348, SEQ ID NO:2349, SEQ ID NO:2350, SEQ ID NO:2351, SEQ ID NO:2352, SEQ ID 
NO:2353, SEQ ID NO:2354, SEQ ID NO:2355, SEQ ID NO:2356, SEQ ID NO:2357, SEQ ID 
NO:2358, SEQ ID NO:2359, SEQ ID NO:2360, SEQ ID NO:2361, SEQ ID NO:2362, SEQ ID 
NO:2363, SEQ ID NO:2364, SEQ ID NO:2365, SEQ ED NO:2366, SEQ ID NO:2367, SEQ ID 
NO:2368, SEQ ID NO:2369, SEQ ID NO:2370, SEQ ID NO:2371, SEQ ID NO:2372, SEQ ID 
NO:2373, SEQ ID NO:2374, SEQ ID NO:2375, SEQ ID NO:2376, SEQ ID NO:2377, SEQ ID 
NO:2378, SEQ ID NO:2379, SEQ ID NO:2380, SEQ ID NO:2381, SEQ ID NO:2382, SEQ ID 
NO:2383, SEQ ID NO:2384, SEQ ID NO:2385, SEQ ID NO:2386, SEQ ID NO:2387, SEQ ID 
NO:2388, SEQ ID NO:2389, SEQ ID NO:2390, SEQ ID NO:2391, SEQ ID NO:2392, SEQ ID 
NO:2393, SEQ ID NO:2394, SEQ ID NO:2395, SEQ ID NO:2396, SEQ ID NO:2397, SEQ ID 
NO:2398, SEQ ID NO:2399. The RNA level may be detected by hybridization to the probes. In a 
further variation, the RNA level is detected by hybridization to an oligonucleotide. Examples of 
oligonucleotide include oligonucleotides having a nucleotide sequence selected from SEQ ID NO:2, 
SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID 
NO:9, SEQ ID NO: 1 0, SEQ ID NO: 1 1 , SEQ ID NO: 1 2, SEQ ID NO: 1 3, SEQ ID NO: 1 4, SEQ ID 
NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO:20, SEQ ID 
NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:25, SEQ ID NO:26, SEQ ID 
NO:27, SEQ ID NO:28, SEQ ID NO:29, SEQ ID NO:30, SEQ ID NO:31, SEQ ID NO:32, SEQ ID 
NO:33, SEQ ID NO:34, SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ ID 
NO:39, SEQ ID NO:40, SEQ ID NO:41, SEQ ID NO:42, SEQ ID NO:43, SEQ ID NO:44, SEQ ID 
NO:45, SEQ ID NO:46, SEQ ID NO:47, SEQ ID NO:48, SEQ ID NO:49, SEQ ID NO:50, SEQ ID 
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NO:5 1 , SEQ ID NO:52, SEQ ID NO:53, SEQ ID NO:54, SEQ ID NO:55, SEQ ID NO:56, SEQ ID 
NO:57, SEQ ID NO:58, SEQ ID NO:59, SEQ ID NO:60, SEQ ID N0:61, SEQ ID NO:62, SEQ ID 
NO:63, SEQ ID NO:64, SEQ ID NO:65, SEQ ID NO:66, SEQ ID NO:67, SEQ ID NO:68, SEQ ID 
NO:69, SEQ ID NO:70, SEQ ID NO:71, SEQ ID NO:72, SEQ ID NO:73, SEQ ID NO:74, SEQ ID 
NO:75, SEQ ID NO:76, SEQ ID NO: ,77,. SEQ ID NO;78, SEQ ID NO:79, SEQ ID NO:80, SEQ ID 
NO:81, SEQ ID NO:82, SEQ ID NO:83, SEQ ID NO:84, SEQ ID NO:85, SEQ ID NO:86, SEQ ID 
NO:87, SEQ ID NO:88, SEQ ID NO:89, SEQ ID NO:90, SEQ ID NO:91, SEQ ID NO:92, SEQ ID 
NO:93, SEQ ID NO:94, SEQ ID NO:95, SEQ ID NO:96, SEQ ID NO:97, SEQ ID NO:98, SEQ ID 
NO:99, SEQ ID NOrlOO, SEQ ID NO:101, SEQ IDNO:102, SEQ ID NO: 103, SEQ ID NO: 104, SEQ 
ID NO: 1 05, SEQ ID NO: 1 06, SEQ ID NO: 1 07, SEQ ID NO: 1 08, SEQ ID NO: 1 09, SEQ ID NO: 1 1 0, 
SEQ ID NO: 1 1 1 , SEQ ID NO: 1 1 2, SEQ ID NO: 1 1 3, SEQ ID NO: 1 14, SEQ ID NO: 1 1 5, SEQ ID 
NO:l 16, SEQ ID NO:l 17, SEQ ID NO:l 18, SEQ ID NO:l 19, SEQ ID NO: 120, SEQ ID NO: 121, SEQ 
ID NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID NO:127, 
SEQ ID NO: 128, SEQ ID NO: 129, SEQ ID NO: 130, SEQ ID NO: 131, SEQ ID NO: 132, SEQ ID 
NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO:136, SEQ ID NO:I37, SEQ ID NO:138, SEQ 
ID NO:139, SEQ ID NO:140, SEQ ID NO:141, SEQ ID NO:142, SEQ ID NO:143, SEQ ID NO: 144, 
SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 147, SEQ ID NO: 148, SEQ ID NO: 149, SEQ ID 
NO: 1 50, SEQ ID NO: 151, SEQ ID NO: 1 52, SEQ ID NO: 1 53, SEQ ID NO: 1 54, SEQ ID NO: 1 55, SEQ 
ID NO: 156, SEQ ID NO: 1 57, SEQ ID NO: 1 58, SEQ ID NO: 1 59, SEQ ID NO: 1 60, SEQ ID NO: 161, 
SEQ ID NO:162, SEQ ID NO:163, SEQ ID NO:164, SEQ ID NO:165, SEQ ID NO:166, SEQ ID 
NO:167, SEQ ID NO:168, SEQ ID NO:169, SEQ ID NO:170, SEQ ID NO:171, SEQ ID NO:172, SEQ 
ID NO: 173, SEQ ID NO: 174, SEQ ID NO: 175, SEQ ID NO: 176, SEQ ID NO: 177, SEQ ID NO: 178, 
SEQ ID NO:179, SEQ ID NO:180, SEQ ID NO:181, SEQ ID NO:182, SEQ ID NO:183, SEQ ID 
NO: 1 84, SEQ ID NO: 1 85, SEQ ID NO: 1 86, SEQ ID NO: 1 87, SEQ ID NO: 1 88, SEQ ID NO: 1 89, SEQ 
ID NO: 190, SEQ ID NO: 191, SEQ ID NO: 192, SEQ ID NO: 193, SEQ ID NO: 194, SEQ ID NO: 195, 
SEQ ID NO: 196, SEQ ID NO: 197, SEQ ID NO: 198, SEQ ID NO: 199, SEQ ID NO:200, SEQ ID 
NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ 
ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID NO:212, 
SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID 
NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ 
ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, 
SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID 
NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ 
ID NO:241, SEQ ID NO:242, SEQ ID NO:243, SEQ ID NO:244, SEQ ID NO:245, SEQ ID NO:246, 
SEQ ID NO:247, SEQ ID NO:248, SEQ ID NO:249, SEQ ID NO:250, SEQ ID NO:251, SEQ ID 
NO:252, SEQ ID NO:253, SEQ ID NO:254, SEQ ID NO:255, SEQ ID NO:256, SEQ ID NO:257, SEQ 
ID NO:258, SEQ ID NO:259, SEQ ID NO:260, SEQ ID NO:261, SEQ ID NO:262, SEQ ID NO:263, 
SEQ ID NO:264, SEQ ID NO:265, SEQ ID NO:266, SEQ ID NO:267, SEQ ID NO:268, SEQ ID 
NO:269, SEQ ID.NO:270, SEQ ID NO:271, SEQ ID NO:272, SEQ ID NO:273, SEQ ID NO:274, SEQ 
ID NO:275, SEQ ID NO:276, SEQ ID NO:277, SEQ ID NO:278, SEQ ID NO:279, SEQ ID NO:280, 
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SEQ ID NO:281, SEQ ID NO:282, SEQ ID NO:283, SEQ ID NO:284, SEQ ID NO:285, SEQ ID 
NO:286, SEQ ID NO:287, SEQ ID NO:288, SEQ ID NO:289, SEQ ID NO:290, SEQ ID NO:291 , SEQ 
ID NO:292, SEQ ID NO:293, SEQ ID NO:294, SEQ ID NO:295, SEQ ID NO:296, SEQ ID NO:297, 
SEQ ID NO:298, SEQ ID NO:299, SEQ ID NO:300, SEQ ID NO:301 , SEQ ID NO:302, SEQ ID 
NO:303, SEQ ID NO:304, SEQ ID NO:305, SEQ ID NO:306, SEQ ID NO:307, SEQ ID NO:308, SEQ 
ID NO:309, SEQ ID NO:310, SEQ ID NO:31 1, SEQ ID NO:312, SEQ ID NO:313, SEQ ID NO:314, 
SEQ ID NO:315, SEQ ID NO:316, SEQ ID NO:3I7, SEQ ID NO:318, SEQ ID NO:319, SEQ ID 
NO:320, SEQ ID NO:321, SEQ ID NO:322, SEQ ID NO:323, SEQ ID NO:324, SEQ ID NO:325, SEQ 
ID NO:326, SEQ ID NO:327, SEQ ID NO:328, SEQ ID NO:329, SEQ ID NO:330, SEQ ID NO:331, 
SEQ ID NO:332, SEQ ID NO:2697, SEQ ID NO:2645, SEQ ID NO:2707, SEQ ID NO:2679, SEQ ID 
NO.2717, SEQ ID NO:2646, SEQ ID NO:2667, SEQ ID NO:2706, SEQ ID NO:2740, SEQ ID 
NO:2669, SEQ ID NO:2674, SEQ ID NO:2743, SEQ ID NO:2716, SEQ ID NO:2727, SEQ ID 
NO:2721, SEQ ID NO:2641, SEQ ID NO:2671, SEQ ID NO:2752, SEQ ID NO:2737, SEQ ID 
NO:2719, SEQ ID NO:2684, SEQ ID NO:2677, SEQ ID NO:2748, SEQ ID NO:2703, SEQ ID 
NO:271 1, SEQ ID NO:2663, SEQ ID NO:2657, SEQ ID NO:2683, SEQ ID NO:2686, SEQ ID 
NO:2687, SEQ ID NO:2644, SEQ ID NO:2664, SEQ ID NO:2747, SEQ ID NO:2744, SEQ ID 
NO:2678, SEQ ID NO.2731, SEQ ID NO:2713, SEQ ED NO:2736, SEQ ID NO:2708, SEQ ID 
NO:2670, SEQ ID NO:2661, SEQ ID NO:2680, SEQ ID NO:2754, SEQ ID NO:2728, SEQ ID 
NO:2742, SEQ ID NO:2668, SEQ ID NO:2750, SEQ ID NO:2746, SEQ ID NO:2738, SEQ ID 
NO:2627, SEQ ID NO.2739, SEQ ID NO:2647, SEQ ID NO:2628, SEQ ID NO:2638, SEQ ID 
NO:2725, SEQ ID NO:2714, SEQ ID NO:2635, SEQ ID NO:2751, SEQ ID NO:2629, SEQ ID 
NO:2695, SEQ ID NO:2741, SEQ ID NO:2691, SEQ ID NO:2726, SEQ ID NO:2722, SEQ ID 
NO:2689, SEQ ID NO:2734, SEQ ID NO:2631, SEQ ED NO:2656, SEQ ID NO:2696, SEQ ID 
NO:2676, SEQ ID NO:2701, SEQ ID NO:2730, SEQ ED NO:2710, SEQ ID NO:2632, SEQ ID 
NO:2724, SEQ ID NO:2698, SEQ ID NO:2662, SEQ ID NO:2753, SEQ ED NO:2704, SEQ ID 
NO:2675, SEQ ID NO:2700, SEQ ID NO:2640, SEQ ID NO:2723, SEQ ID NO:2658, SEQ ID 
NO:2688, SEQ ID NO:2735, SEQ ED NO:2702, SEQ ID NO:2681, SEQ ED NO:2755, SEQ ID 
NO:2715, SEQ ID NO:2732, SEQ ED NO:2652, SEQ ID NO:2651, SEQ ID NO:2718, SEQ ID 
NO:2673, SEQ ID NO:2733, SEQ ID NO:2712, SEQ ID NO:2659, SEQ ID NO:2654, SEQ ID 
NO:2636, SEQ ID NO.2639, SEQ ID NO:2690, SEQ ID NO:2705, SEQ ED NO:2685, SEQ ID 
NO:2692, SEQ ID NO:2693, SEQ ID NO:2648, SEQ ID NO:2650, SEQ ID NO:2720, SEQ ID 
NO:2660, SEQ ID NO:2666, SEQ ID NO:2699, SEQ ID NO:2633, SEQ ED NO:2672, SEQ ID 
NO:2642, SEQ ID NO:2682, SEQ ID NO:2655, SEQ ID NO:2630, SEQ ID NO:2745, SEQ ID 
NO:2643, SEQ ID NO:2694, SEQ ID NO:2749, SEQ ID NO:2665, SEQ ED NO:2649, SEQ ID 
NO:2637, SEQ ID NO:2634, SEQ ED NO:2709, SEQ ID NO:2653, SEQ ID NO:2729. In a further 
variation, the oligonucleotide has the nucleotide sequence SEQ ID NO: 36. In still a further variation, 
the oligonucleotide has the nucleotide sequence SEQ ID NO: 87. In yet a further variation, the 
oligonucleotide has the nucleotide sequence SEQ ID NO: 94. In an additional variation, the 
oligonucleotide has a nucleotide sequence consisting of SEQ ID NO: 91. In another variation, the 
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oligonucleotide has a nucleotide sequence consisting of SEQ ID NO: 107. The oligonucleotide may 
be DNA, RNA, cDNA, PNA, genomic DNA, or synthetic oligonucleotides. 

In another aspect, the methods of detecting transplant rejection include detecting the 
expression level by measuring one or more proteins expressed by the one or more genes. In one 
variation, the one or more proteins include.an amino acid sequence selected from SEQ ID NO.2400, 
SEQ ID NO:2401, SEQ ID NO:2402, SEQ ID NO:2403, SEQ ID NO:2404, SEQ ID NO:2405, SEQ ID 
NO:2407, SEQ ID NO:2408, SEQ ID NO:2409, SEQ ID NO:2410, SEQ ID NO:241 1, SEQ ID 
NO:2412, SEQ ID NO:2413, SEQ ID NO:2414, SEQ ID NO:2415, SEQ ID NO:2416, SEQ ID 
NO:2417, SEQ ID NO:2418, SEQ ID NO.2419, SEQ ID NO.2420, SEQ ID NO.2421, SEQ ID 
NO:2422, SEQ ID NO:2423, SEQ ID NO:2424, SEQ ID NO:2425, SEQ ID NO:2426, SEQ ID 
NO:2427, SEQ ID NO:2428, SEQ ID NO:2429, SEQ ID NO:2430, SEQ ID NO:2432, SEQ ID 
NO:2433, SEQ ID NO:2434, SEQ ID NO:2435, SEQ ID NO:2436, SEQ ID NO:2437, SEQ ID 
NO:2438, SEQ ID NO.2439, SEQ ID NO:2440, SEQ ID NO:2441, SEQ ID NO:2442, SEQ ID 
NO:2443, SEQ ID NO:2444, SEQ ID NO:2445, SEQ ID NO:2446, SEQ ID NO:2447, SEQ ID 
NO:2448, SEQ !DNO:2449, SEQ ID NO:2450, SEQ ID NO:2451, SEQ ID NO:2452, SEQ ID 
NO:2453, SEQ ID NO:2454, SEQ ID NO:2455, SEQ ID NO:2456, SEQ ID NO:2457, SEQ ID 
NO:2458, SEQ ID NO:2459, SEQ ID NO:2460, SEQ ID NO:2461, SEQ ID NO:2462, SEQ ID 
NO:2463, SEQ ID NO:2464, SEQ ID NO.2465, SEQ ID NO:2466, SEQ ID NO:2467, SEQ ID 
NO:2468, SEQ ID NO:2469, SEQ ID NO.2470, SEQ ID NO:2478, SEQ ID NO:2479, SEQ ID 
NO:2480, SEQ ID NO:2481, SEQ ID NO:2482, SEQ ID NO:2483, SEQ ID NO:2485, SEQ ID 
NO:2486, SEQ ID NO:2488, SEQ ID NO:2491, SEQ ID NO.2492, SEQ ID NO:2493, SEQ ID 
NO:2494, SEQ ID NO:2495, SEQ ID NO:2496, SEQ ID NO:2497, SEQ ID NO:2502, SEQ ID 
NO:2503, SEQ IDNO:2504, SEQ ID NO:2505, SEQ ID NO:2506, SEQ ID NO:2507, SEQ ID 
NO:2508, SEQ ID NO:2509, SEQ ID NO:2510, SEQ ID NO:251 1, SEQ ID NO:2512, SEQ ID 
NO:2513, SEQ ID NO:2514, SEQ ID NO:2515, SEQ ID NO:2516, SEQ ID NO:2517, SEQ ID 
NO-.2518, SEQ ID NO:2519, SEQ ID NO.2520, SEQ ID NO:2521, SEQ ID NO:2528, SEQ ID 
NO:2529, SEQ ID NO:2530, SEQ ID NO:2531, SEQ ID NO:2532, SEQ ID NO:2533, SEQ ID 
NO:2534, SEQ ID NO:2535, SEQ ID NO:2536, SEQ ID NO:2537, SEQ ID NO:2538, SEQ ID 
NO:2539, SEQ ID NO:2540, SEQ ID NO:2541, SEQ ID NO:2542, SEQ ID NO:2543, SEQ ID 
NO.2544, SEQ ID NO:2545, SEQ ID NO:2546, SEQ ID NO:2547, SEQ ID NO:2548, SEQ ID 
NO:2549, SEQ ID NO:2550, SEQ ID NO:255 1, SEQ ID NO:2552, SEQ ID NO:2553, SEQ ID 
NO:2554, SEQ ID NO:2555, SEQ ID NO:2556, SEQ ID NO:2557, SEQ ID NO:2558, SEQ ID 
NO:2559, SEQ ID NO:2560, SEQ ID NO:2561, SEQ ID NO:2562, SEQ ID NO:2563, SEQ ID 
NO:2564, SEQ ID NO:2565, SEQ ID NO.2566, SEQ ID NO-.2567, SEQ ID NO:2568, SEQ ID 
. MO.-2569, SEQ ID NO.2570, SEQ ID NO:257 1 , SEQ ID NO:2572, SEQ ID NO:2573, SEQ ID 
NO:2574, SEQ ID NO:2575, SEQ ID NO:2576, SEQ ID NO:2577, SEQ ID NO:2578, SEQ ID 
NO:2579, SEQ ID NO:2580, SEQ ID NO:2581, SEQ ID NO:2582, SEQ ID NO:2583, SEQ ID 
NO:2584, SEQ ID NO:2585, SEQ ID NO:2586, SEQ ID NO:2587, SEQ ID NO:2588, SEQ ID 
NO:2589, SEQ ID NO:2590, SEQ ID NO:2591, SEQ ID NO:2592, SEQ ID NO:2593, SEQ ID 
NO:2594, SEQ ID NO:2595, SEQ ID NO:2596, SEQ ID NO:2597, SEQ ID NO:2598, SEQ ID 
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NO.2599, SEQ ID NO:2600, SEQ ID NO:2601, SEQ ID NO:2602, SEQ ID NO:2603, SEQ ID 
NO:2604, SEQ ID NO:2605, SEQ ID NO:2606, SEQ ID NO:2607, SEQ ID NO:2608, SEQ ID 
NO:2609, SEQ ID NO:2610, SEQ ID NO:261 1, SEQ ID NO:2612, SEQ ID NO:2613, SEQ ID 
NO:2614, SEQ ID NO:2615, SEQ ID NO:2616, SEQ ID NO:2617, SEQ ID NO:2618, SEQ ID 
NO:2619, SEQ ID NO:2620, SEQ ID NO:2621, SEQ ID NO:2622, SEQ ID NO:2623, SEQ ID 
NO:2624, SEQ ID NO:2625, SEQ ID NO:2626, SEQ ID NO:2925, SEQ ID NO:2926, SEQ ID 
NO:2927, SEQ ID NO:2928, SEQ ID NO:2929, SEQ ID NO:2930, SEQ ID NO:2932, SEQ ID 
NO:2933, SEQ ID NO:2935, SEQ ID NO:2936, SEQ ID NO:2937, SEQ ID NO:2938, SEQ ID 
NO:2939, SEQ ID NO:2941, SEQ ID NO:2942, SEQ ID NO:2943, SEQ ID NO:2945, SEQ ID 
NO:2946, SEQ ID NO:2947, SEQ ID NO:2948, SEQ ID NO:2949, SEQ ID NO.2950, SEQ ID 
NO:2951, SEQ ID NO:2952, SEQ ID NO:2953, SEQ ID NO:2954, SEQ ID NO:2955, SEQ ID 
NO:2956, SEQ ID NO:2957, SEQ ID NO:2959, SEQ ID NO:2960, SEQ ID NO:2961, SEQ ID 
NO:2962, SEQ ID NO:2963, SEQ ID NO:2964, SEQ ID NO:2965, SEQ ID NO:2966, SEQ ID 
NO:2967, SEQ ID NO:2968, SEQ ID NO:2969, SEQ ID NO:2970, SEQ ID NO:2971, SEQ ID 
NO:2972, SEQ ID NO:2973, SEQ ID NO:2974, SEQ ID NO:2975, SEQ ID NO:2976, SEQ ID 
NO:2977, SEQ ID NO:2978, SEQ ID NO:2979, SEQ ID NO:2980, SEQ ID NO:2981, SEQ ID 
NO:2982, SEQ ID NO:2983, SEQ ID NO:2984, SEQ ID NO:2985, SEQ ID NO:2986, SEQ ID 
NO:2987, SEQ ID NO:2988, SEQ ID NO:2989, SEQ ID NO:2990, SEQ ID NO:2991, SEQ ID 
NO:2992, SEQ ID NO:2993, SEQ ID NO:2994, SEQ ID NO:2995, SEQ ID NO:2996, SEQ ID 
NO:2997, SEQ ID NO:2998, SEQ ID NO:2999, SEQ ID NO:3000, SEQ ID NO:3001, SEQ ID 
NO:3002, SEQ ID NO:3003, SEQ ID NO:3004, SEQ ID NO:3005, SEQ ID NO:3006, SEQ ID 
NO:3007, SEQ ID NO:3008, SEQ ID NO:3009, SEQ ID NO:3010, SEQ ID NO:301 1, SEQ ID 
NO:3012, SEQ ID NO:3013, SEQ ID NO:3014, SEQ ID NO:3015. In a further variation, the the 
method includes detecting one or more additional proteins expressed by SEQ ID NO:2406, SEQ ID 
NO:2431, SEQ ID NO.2471, SEQ ID NO:2472, SEQ ID NO:2473, SEQ ID NO:2474, SEQ ID 
NO:2475, SEQ ID NO:2476, SEQ ID NO:2477, SEQ ID NO:2484, SEQ ID NO:2487, SEQ ID 
NO:2489, SEQ ID NO:2490, SEQ ID NO:2498, SEQ ID NO:2499, SEQ ID NO:2500, SEQ ID 
NO:2501, SEQ ID NO:2522, SEQ ID NO:2523, SEQ ID NO:2524, SEQ ID NO:2525, SEQ ID 
NO:2526, SEQ ID NO:2527. In still another variation, one or more proteins may be selected from 
SEQ ID NO:2400, SEQ ID NO:2401, SEQ ID NO:2402, SEQ ID NO:2403, SEQ ID NO:2404, SEQ ID 
NO:2405, SEQ ID NO:2407, SEQ ID NO:2408, SEQ ID NO.2409, SEQ ID NO:2410, SEQ ID 
NO:241 1, SEQ ID NO:2412, SEQ ID NO:2413, SEQ ID NO:2414, SEQ ID NO:2415, SEQ ID 
NO:2416, SEQ ID NO:2417, SEQ ID NO:2418, SEQ ID NO:2419, SEQ ID NO:2420, SEQ ID 
NO:2421, SEQ ID NO:2422, SEQ ID NO:2423, SEQ ID NO:2424, SEQ ID NO:2425, SEQ ID 
NO:2426, SEQ ID NO:2427, SEQ ID NO.2428, SEQ ID NO:2429, SEQ ID NO:2430, SEQ ID 
NO:2432, SEQ ID NO:2433, SEQ ID NO:2434, SEQ ID NO:2435, SEQ ID NO:2436, SEQ ID 
NO:2437, SEQ ID NO:2438, SEQ ID NO:2439, SEQ ID NO:2440, SEQ ID NO:2441, SEQ ID 
NO:2442, SEQ ID NO:2443, SEQ ID NO:2444, SEQ ID NO:2445, SEQ ID NO:2446, SEQ ID 
NO:2447, SEQ ID NO:2448, SEQ ID NO:2449, SEQ ID NO: 2450, SEQ ID NO:2451, SEQ ID 
NO:2452, SEQ ID NO:2453, SEQ ID NO:2454, SEQ ID NO:2455, SEQ ID NO:2456, SEQ ID 
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NO:2457, SEQ ID NO:2458, SEQ ID NO.2459, SEQ ID NO.2460, SEQ ID NO:2461, SEQ ID 
NO:2462, SEQ ID NO.2463, SEQ ID NO:2464, SEQ ID NO:246S, SEQ ID NO:2466, SEQ ID 
NO:2467, SEQ ID NO:2468, SEQ ID NO:2469, SEQ ID NO:2470, SEQ ID NO:2478, SEQ ID 
NO:2479, SEQ ID NO:2480, SEQ ID NO:2481, SEQ ID NO:2482, SEQ ID NO:2483, SEQ ID 
NO:2485, SEQ ID NO:2486, SEQ ID NO:2488, SEQ ID NO.2491, SEQ ID NO.2492, SEQ ID 
NO:2493, SEQ ID NO:2494, SEQ ID NO.2495, SEQ ID NO:2496, SEQ ID NO:2497, SEQ ID 
NO:2502, SEQ ID NO:2503, SEQ ID NO:2504, SEQ ID NO:2505, SEQ ID NO:2506, SEQ ID 
NO:2507, SEQ ID NO:2508, SEQ ID NO:2509, SEQ ID NO:2510, SEQ ID NO:251 1, SEQ ID 
NO:2512, SEQ ID NO:2513, SEQ ID NO:2514, SEQ ID NO:2515, SEQ ID NO.2516, SEQ ID 
NO:2517, SEQ ID NO.2518, SEQ ID NO:2519, SEQ ID NO:2520, SEQ ID NO:2521, SEQ ID 
NO-.2528, SEQ ID NO:2529, SEQ ID NO.2530, SEQ ID NO:2531, SEQ ID NO:2532, SEQ ID 
NO:2533, SEQ ID NO:2534, SEQ ID NO:2535, SEQ ID NO:2536, SEQ ID NO:2537, SEQ ID 
NO:2538, SEQ ID NO:2539, SEQ ID NO:2540, SEQ ID NO:2541, SEQ ID NO:2542, SEQ ID 
NO:2543, SEQ ID NO:2544, SEQ ID NO:2545, SEQ ID NO:2546, SEQ ID NO:2547, SEQ ID 
NO:2548, SEQ ID NO:2549, SEQ ID NO:2550, SEQ ED NO:2551, SEQ ID NO:2552, SEQ ID 
NO:2553, SEQ ID NO.2554, SEQ ID NO:2555, SEQ ID NO:2556, SEQ ID NO:2557, SEQ ID 
NO:2558, SEQ ID NO:2559, SEQ ID NO:2560, SEQ ID NO:2561, SEQ ID NO.2562, SEQ ID 
NO.-2563, SEQ ID NO:2S64, SEQ ID NO:2565, SEQ ID NO.2566, SEQ ID NO:2567, SEQ ID 
NO:2568, SEQ ID NO:2569, SEQ ID NO:2570, SEQ ID NO:2571, SEQ ID NO:2572, SEQ ID 
NO:2573, SEQ ID NO:2574, SEQ ID NO:2575, SEQ ID NO:2576, SEQ ID NO:2577, SEQ ID 
NO:2578, SEQ ID NO:2579, SEQ ID NO:2580, SEQ ID NO:2581, SEQ ID NO:2S82, SEQ ID 
NO:2583, SEQ ID NO.2584, SEQ ID NO:2585, SEQ ID NO:2586, SEQ ID NO:2587, SEQ ID 
NO.2588, SEQ ID NO:2589, SEQ ID NO:2590, SEQ ID NO:2591, SEQ ID NO:2592, SEQ ID 
NO:2593, SEQ ID NO:2594, SEQ ID NO:2595, SEQ ID NO:2596, SEQ ID NO:2597, SEQ ID 
NO:2598, SEQ ID NO:2599, SEQ ID NO:2600, SEQ ID NO:2601, SEQ ID NO:2602, SEQ ID 
NO:2603, SEQ ID NO.2604, SEQ ID NO.2605, SEQ ID NO.2606, SEQ ID NO:2607, SEQ ID 
NO.2608, SEQ ID NO:2609, SEQ IDNO:2610, SEQ ID NO:261 1, SEQ ID NO:2612, SEQ ID 
NO:2613, SEQ ID NO:2614, SEQ ID NO:2615, SEQ ID NO:2616, SEQ ID NO:2617, SEQ ID 
NO:2618, SEQ ID NO:2619, SEQ ID NO:2620, SEQ ID NO:2621, SEQ ID NO:2622, SEQ ID 
NO:2623, SEQ ID NO:2624, SEQ ID NO:2625, SEQ ID NO:2626, SEQ ID NO:2925, SEQ ID 
NO:2926, SEQ ID NO:2927, SEQ ID NO:2928, SEQ ID NO:2929, SEQ ID NO:2930, SEQ ID 
NO:2932, SEQ ID NO:2933, SEQ ID NO:2935, SEQ ID NO.2936, SEQ ID NO:2937, SEQ ID 
NO:2938, SEQ ID NO:2939, SEQ ID NO:2941, SEQ ID NO:2942, SEQ ID NO:2943, SEQ ID 
NO.2945, SEQ ID NO:2946, SEQ ID NO:2947, SEQ ID NO:2948, SEQ ID NO:2949, SEQ ID 
NO:2950, SEQ ID NO:2951, SEQ ID NO:2952, SEQ ID NO:2953, SEQ ID NO:2954, SEQ ID 
NO:2955, SEQ ID NO:2956, SEQ ID NO.-2957, SEQ ID NO:2959, SEQ ID NO:2960, SEQ ID 
NO:2961, SEQ ID NO:2962; SEQ ID NO:2963, SEQ ID NO:2964, SEQ ID NO:2965, SEQ ID 
NO.-2966, SEQ ID NO:2967, SEQ ID NO:2968, SEQ ID NO.2969, SEQ ID NO.2970, SEQ ID 
NO:2971, SEQ ID NO:2972, SEQ ID NO:2973, SEQ ID NO:2974, SEQ ID NO:2975, SEQ ID 
NO:2976, SEQ ID NO:2977, SEQ ID NO:2978, SEQ ID NO.-2979, SEQ ID NO:2980, SEQ ID 
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NO:2981, SEQ ID NO:2982, SEQ ID NO:2983, SEQ ID NO:2984, SEQ IDNO:2985, SEQ ID 
NO:2986, SEQ ID NO:2987, SEQ ID NO:2988, SEQ ID NO:2989, SEQ ID NO:2990, SEQ ID 
NO:2991, SEQ ID NO:2992, SEQ ID NO:2993, SEQ ID NO:2994, SEQ ID NO:2995, SEQ ID 
NO:2996, SEQ ID NO:2997, SEQ ID NO:2998, SEQ ID NO:2999, SEQ ID NO.3000, SEQ ID 
NO:3001, SEQ ID NO:3002, SEQ ID NO:3003, SEQ ID NO:3004, SEQ ID NO:3005, SEQ ID 
NO:3006, SEQ ID NO:3007, SEQ ID NO:3008, SEQ ID NO:3009, SEQ ID NO:3010, SEQ ID 
NO:301 1, SEQ ID NO:3012, SEQ ID NO:3013, SEQ ID NO:3014, SEQ ID NO:3015, and one or more 
proteins may be selected from SEQ ID NO:2406, SEQ ID NO:2431, SEQ ID NO:247I, SEQ ID 
NO:2472, SEQ ID NO:2473, SEQ ID NO:2474, SEQ ID NO:2475, SEQ ID NO:2476, SEQ ID 
NO:2477, SEQ ID NO:2484, SEQ ID NO:2487, SEQ ID NO:2489, SEQ ID NO:2490, SEQ ID 
NO:2498, SEQ ID NO:2499, SEQ ID NO:2500, SEQ ID NO:2501, SEQ ID NO:2522, SEQ ID 
NO:2523, SEQ ID NO:2524, SEQ ID NO:2525, SEQ ID NO:2526 > SEQ ID NO:2527. 

In another aspect, the method of diagnosing or monitoring cardiac transplant rejection in a 
patient includes detecting the expression level of one or more genes in the patient to diagnose or 
monitor cardiac transplant rejection in the patient by measuring one or more proteins expressed by the 
one or more genes. The one or more proteins may include an amino acid sequence selected from SEQ 
ID NO.2400, SEQ ID 1*0:2401, SEQ ID NO:2402, SEQ ID NO.2403, SEQ ID NO:2404, SEQ ID 
NO.2405, SEQ ID NO.2407, SEQ ID NO:2408, SEQ ID NO:2409, SEQ ID NO:2410, SEQ ID 
NO:241 1, SEQ ID NO:2412, SEQ ID NO:2413, SEQ ID NO:2414, SEQ ID NO:2415, SEQ ID 
NO:2416, SEQ ID NO:2417, SEQ ID NO.2418, SEQ ID NO:2419, SEQ ID NO:2420, SEQ ID 
NO:2421, SEQ ID NO.2422, SEQ ID NO:2423, SEQ ID NO:2424, SEQ ID NO:2425, SEQ ID 
NO:2426, SEQ ID NO:2427, SEQ ID NO:2428, SEQ ID NO:2429, SEQ ID NO:2430, SEQ ID 
NO:2432, SEQ ID NO:2433, SEQ ID NO:2434, SEQ ID NO:2435, SEQ ID NO:2436, SEQ ID 
NO:2437, SEQ ID NO:2438, SEQ ID NO:2439, SEQ ID NO:2440, SEQ ID NO:2441, SEQ ID 
NO:2442, SEQ ID NO:2443, SEQ ID NO:2444, SEQ ID NO:2445, SEQ ID NO:2446, SEQ ID 
NO:2447, SEQ ID NO:2448, SEQ ID NO:2449, SEQ ID NO:2450, SEQ ID NO:2451, SEQ ID 
NO:2452, SEQ ID NO:2453, SEQ ID NO:2454, SEQ ID NO:2455, SEQ ID NO:2456, SEQ ID 
NO:2457, SEQ ID NO:2458, SEQ ID NO:2459, SEQ ID NO:2460, SEQ ID NO:2461, SEQ ID 
NO:2462, SEQ ID NO:2463, SEQ ID NO:2464, SEQ ID NO:2465, SEQ ID NO:2466, SEQ ID 
NO:2467, SEQ ID NO:2468, SEQ ID NO:2469, SEQ ID NO:2470, SEQ ID NO:2471, SEQ ID 
NO:2476, SEQ ID NO:2477, SEQ ID NO:2478, SEQ ID NO:2479, SEQ ID NO:2480, SEQ ID 
NO:2481, SEQ ID NO:2482, SEQ ID NO:2483, SEQ ID NO:2484, SEQ ID NO:2485, SEQ ID 
NO.2486, SEQ ID NO:2488, SEQ ID NO:2489, SEQ ID NO:2490, SEQ ID NO:2491, SEQ ID 
NO:2492, SEQ ID NO:2493, SEQ ID NO.2494, SEQ ID NO:2495, SEQ ID NO:2496, SEQ ID 
NO:2497, SEQ ID NO:2498, SEQ ID NO:2499, SEQ ID NO:2500, SEQ ID NO:2501, SEQ ID 
NO:2502, SEQ ID NO:2503, SEQ ID NO:2504, SEQ ID NO:2505, SEQ ID NO.2506, SEQ ID 
NO:2507, SEQ ID NO:2508, SEQ ID NO:2509, SEQ ID NO:2510, SEQ ID NO:251 1, SEQ ID 
NO:2512, SEQ ID NO:2513, SEQ IDNO:2514, SEQ ID NO:2515, SEQ IDNO:2516, SEQ ID 
NO:2517, SEQ ID NO:2518, SEQ IDNO:2519, SEQ ID NO:2520, SEQ IDNO:2521, SEQ ID 
NO:2528, SEQ ID NO:2529, SEQ ID NO:2530, SEQ ID NO:2531, SEQ ID NO:2532, SEQ ID 
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NO:2533, SEQ ID NO:2534, SEQ ID NO:2535, SEQ ID NO:2536, SEQ ID NO:2537, SEQ ID 
NO:2538, SEQ ID NO:2539, SEQ IDNO:2540, SEQ ID NO:2541, SEQ ID NO:2542, SEQ ID 
NO:2543, SEQ ID NO:2544, SEQ ID NO:2545, SEQ ID NO:2546, SEQ ID NO:2547, SEQ ID 
NO:2548, SEQ ID NO:2549, SEQ ID NO:2550, SEQ ID NO:2551, SEQ ID NO:2552, SEQ ID 
NO:2553, SEQ ID NO:2554, SEQ ID NO:2555, SEQ ID NO.2556, SEQ ID NO:2557, SEQ ID 
NO:2558, SEQ ID NO:2559, SEQ ID NO:2560, SEQ ID NO:2561, SEQ ID NO:2562, SEQ ID 
NO:2563, SEQ ID NO:2564, SEQ ID NO:2565, SEQ ID NO:2566, SEQ ID NO.2567, SEQ ID 
NO:2568, SEQ ID NO:2569, SEQ ID NO:2570, SEQ ID NO:2571, SEQ ID NO:2572, SEQ ID 
NO:2573, SEQ ID NO:2574, SEQ ID NO:2575, SEQ ID NO:2576, SEQ ID NO:2577, SEQ ID 
NO:2578, SEQ ID NO:2579, SEQ ID NO:2580, SEQ ID NO:2581, SEQ ID NO:2582, SEQ ID 
NO:2583, SEQ ID NO:2584, SEQ ID NO:2585, SEQ ID NO:2586, SEQ ID NO:2587, SEQ ID 
NO:2588, SEQ ID NO:2589, SEQ ID NO:2590, SEQ ID NO:2591, SEQ ID NO:2592, SEQ ID 
NO:2593, SEQ ID NO:2594, SEQ ID NO:2595, SEQ ID NO:2596, SEQ ID NO:2597, SEQ ID 
NO:2598, SEQ ID NO:2599, SEQ ID NO:2600, SEQ ID NO:2601, SEQ ID NO:2602, SEQ ID 
NO:2603, SEQ ID NO:2604, SEQ ID NO:2605, SEQ ID NO.2606, SEQ ID NO:2607, SEQ ID 
NO:2608, SEQ ID NO:2609, SEQ ID NO:2610, SEQ ID NO:261 1, SEQ ID NO:2612, SEQ ID 
NO:2613, SEQ ID NO:2614, SEQ ID NO:2615, SEQ ID NO.2616, SEQ ID NO:2617, SEQ ID 
NO:2618, SEQ ID NO:2619, SEQ ID NO:2620, SEQ ID NO:2621, SEQ ID NO:2622, SEQ ID 
NO:2623, SEQ ID NO:2624, SEQ ID NO:2625, SEQ ID NO:2626. Alternatively, the expression level 
of the one or more genes may be detected by measuring one or more proteins expressed by one or more 
genes, and one or more proteins expressed by one or more additional genes. In one variation, the one 
or more proteins expressed by the one or more genes include an amino acid sequence selected from 
SEQ ID NO:2400, SEQ ID NO:2401, SEQ ID NO:2402, SEQ ID NO:2403, SEQ ID NO:2404, SEQ ED 
NO:2405, SEQ ID NO:2407, SEQ ID NO:2408, SEQ ID NO:2409, SEQ ID NO:2410, SEQ ID 
NO:241 1, SEQ ID NO:2412, SEQ ID NO:2413, SEQ ID NO:2414, SEQ ID NO:2415, SEQ ID 
NO:2416, SEQ ID NO:2417, SEQ ID NO:2418, SEQ ID NO:2419, SEQ ID NO:2420, SEQ ID 
NO:2421, SEQ ID NO:2422, SEQ ID NO:2423, SEQ ID NO:2424, SEQ ID NO:2425, SEQ ID 
NO:2426, SEQ ID NO:2427, SEQ ID NO:2428, SEQ ID NO:2429, SEQ ID NO:2430, SEQ ID 
NO:2432, SEQ ID NO:2433, SEQ ID NO:2434, SEQ ID NO:2435, SEQ ID NO:2436, SEQ ID 
NO:2437, SEQ ID NO:2438, SEQ ID NO:2439, SEQ ID NO:2440, SEQ ID NO:2441, SEQ ID 
NO:2442, SEQ ID NO:2443, SEQ ID NO:2444, SEQ ID NO:2445, SEQ ID NO:2446, SEQ ID 
NO:2447, SEQ ID NO:2448, SEQ ID NO:2449, SEQ ID NO:2450, SEQ ID NO:2451, SEQ ID 
NO:2452, SEQ ID NO:2453, SEQ ID NO:2454, SEQ ID NO:2455, SEQ ID NO:2456, SEQ ID 
NO:2457, SEQ ID NO:2458, SEQ ID NO:2459, SEQ ID NO:2460, SEQ ID NO:2461, SEQ ID 
NO:2462, SEQ ID NO:2463, SEQ ID NO:2464, SEQ ID NO:2465, SEQ ID NO:2466, SEQ ID 
NO:2467, SEQ ID NO:2468, SEQ ID NO:2469, SEQ ID NO:2470, SEQ ID NO:2471, SEQ ED 
NO:2476, SEQ ID NO:2477, SEQ ID NO:2478, SEQ ID NO:2479, SEQ ID NO:2480, SEQ ID 
NO:2481, SEQ ID NO:2482, SEQ ID NO:2483, SEQ ID NO:2484, SEQ ID NO:2485, SEQ ID 
NO:2486, SEQ ID NO:2488, SEQ ID NO:2489, SEQ ID NO:2490, SEQ ID NO:2491, SEQ ID 
NO:2492, SEQ ID NO:2493, SEQ ID NO:2494, SEQ ID NO:2495, SEQ ID NO:2496, SEQ ID 
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NO:2497, SEQ ID NO.2498, SEQ ID NO:2499, SEQ ID NO:2500, SEQ ID NO:2501, SEQ ID 
NO:2502, SEQ ID NO.2503, SEQ ID NO2504, SEQ ID NO2505, SEQ ID NO:2506, SEQ ID 
NO:2507, SEQ ID NO:2508, SEQ ID NO:2509, SEQ ID NO:2510, SEQ ID NO:251 1, SEQ ID 
NO:2512, SEQ ID NO:2513, SEQ ID NO:25l4, SEQ ID NO:2515, SEQ ID NO:2516, SEQ ID 
NO.2517, SEQ ID NO:2518, SEQ ID NO:2519, SEQ ID NO:2520, SEQ ID NO:2521, SEQ ID 
NO:2528, SEQ ID NO:2529, SEQ ID NO:2530, SEQ ID NO:2531, SEQ ID NO:2532, SEQ ID 
NO:2533, SEQ ID NO:2534, SEQ ID NO:2535, SEQ ID NO:2536, SEQ ID NO:2537, SEQ ID 
NO:2538, SEQ ID NO:2539, SEQ ID NO:2540, SEQ ID NO:2541, SEQ ID NO:2542, SEQ ID 
NO:2543, SEQ ID NO:2544, SEQ ID NO:2545, SEQ ID NO:2546, SEQ ID NO:2547, SEQ ID 
NO:2548, SEQ ID NO:2549, SEQ ID NO:2550, SEQ ID NO.2551, SEQ ID NO:2552, SEQ ID 
NO.2553, SEQ ID NO:2554, SEQ ID NO:2555, SEQ ID NO:2556, SEQ ID NO:2557, SEQ ID 
NO:2558, SEQ ID NO:2559, SEQ ID NO:2560, SEQ ID NO:2561, SEQ ID NO:2562, SEQ ID 
NO:2563, SEQ ID NO.2564, SEQ ID NO:2565, SEQ ID NO:2566, SEQ ID NO:2567, SEQ ID 
NO:2568, SEQ ID NO:2569, SEQ ID NO.2570, SEQ ID NO:2571, SEQ ID NO:2572, SEQ ID 
NO:2573, SEQ ID NO:2574, SEQ ID NO:2575, SEQ ID NO:2576, SEQ ID NO:2577, SEQ ID 
NO:2578, SEQ ID NO:2579, SEQ ID NO:2580, SEQ ID NO:258I, SEQ ID NO:2582, SEQ ID 
NO:2583, SEQ ID NO:2584, SEQ ID NO:2585, SEQ ID NO:2586, SEQ ID NO:2587, SEQ ID 
NO.2588, SEQ ID NO:2589, SEQ ID NO:2590, SEQ ID NO:259l, SEQ ID NO:2592, SEQ ID 
NO:2593, SEQ ID NO:2594, SEQ ID NO:2595, SEQ ID NO:2596, SEQ ID NO:2597, SEQ ID 
NO:2598, SEQ ID NO:2599, SEQ ID NO:2600, SEQ ID NO:2601, SEQ ID NO:2602, SEQ ID 
NO:2603, SEQ ID NO:2604, SEQ ID NO:2605, SEQ ID NO:2606, SEQ ID NO:2607, SEQ ID 
NO.2608, SEQ ID NO:2609, SEQ ID NO:2610, SEQ ID NO.261 1, SEQ ID NO:2612, SEQ ID 
NO:2613, SEQ ID NO:2614, SEQ ID NO:2615, SEQ ID NO:2616, SEQ ID NO:2617, SEQ ID 
NO.2618, SEQ ID NO:2619, SEQ ID NO:2620, SEQ ID NO:2621, SEQ ID NO:2622, SEQ ID 
NO:2623, SEQ ID NO:2624, SEQ ID NO:2625, SEQ ID NO:2626, and the one or more protein 
expressed by the one or more additional genes include an amino acid sequence selected from the group 
consisting of SEQ ID NO.2406, SEQ ID NO:2431, SEQ ID NO:2472, SEQ ID NO:2473, SEQ ID 
NO:2474, SEQ ID NO:2475, SEQ ID NO:2487, SEQ ID NO:2522, SEQ ID NO:2523, SEQ ID 
NO:2524, SEQ ID NO:2525, SEQ ID NO:2526, SEQ ID NO:2527. 

In another aspect, the method of diagnosing or monitoring kidney transplant rejection in a 
patient includes detecting the expression level of one or more genes in the patient to diagnose or 
monitor kidney transplant rejection in the patient by measuring one or more proteins encoded by the 
one or more genes. In one variation, the one or more proteins include an amino acid sequence selected 
from SEQ ID NO:2400, SEQ ID NO:2401, SEQ ID NO.2402, SEQ ID NO.2403, SEQ ID NO.2404, 
SEQ ID NO:2405, SEQ ID NO:2406, SEQ ID NO:2407, SEQ ID NO:2408, SEQ ID NO:2409, SEQ ID 
NO:2410, SEQ IDNO:241 1, SEQ ID NO:2412, SEQ ID NO:2413, SEQ ID NO:2414, SEQ ID 
NO:2415, SEQ ED NO:2416, SEQ ID NO:2417, SEQ ID N0.2418, SEQ ID NO:2419, SEQ ID 
NO:2420, SEQ ID NO:2421, SEQ ID NO:2422, SEQ ID NO:2423, SEQ ID NO:2424, SEQ ID 
NO:2425, SEQ ID NO:2426, SEQ ID NO:2427, SEQ ID NO:2428, SEQ ID NO:2429, SEQ ID 
NO:243Q, SEQ ID NO:2432, SEQ ID NO:2433, SEQ ID NO:2434, SEQ ID NO:2435, SEQ ID 
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NO:2436, SEQ ID NO:2437, SEQ ID NO.2438, SEQ ID NO.2439, SEQ ID NO:2440, SEQ ID 
NO:2441, SEQ ID NO.2442, SEQ ID NO:2443, SEQ ID NO:2444, SEQ ID NO:2445, SEQ ID 
NO:2446, SEQ ID NO:2447, SEQ ID NO:2448, SEQ ID NO:2449, SEQ ID NO:2450, SEQ ID 
NO:2451, SEQ ID NO:2452, SEQ ID NO:2453, SEQ ID NO:2454, SEQ ID NO:2455, SEQ ID 
NO:2456, SEQ ID .NO:2457, SEQ ID NO:2458, SEQ ID NO:2459, SEQ ID NO:2460, SEQ ID 
NO:2461, SEQ ID NO:2462, SEQ ID NO:2463, SEQ ID NO:2464, SEQ ID NO:2465, SEQ ID 
NO:2466, SEQ ID NO:2467, SEQ ID NO:2468, SEQ ID NO:2469, SEQ ID NO:2470, SEQ ID 
NO:2474, SEQ ID NO:2478, SEQ ID NO:2479, SEQ ID NO:2480, SEQ ID NO:2481, SEQ ID 
NO:2482, SEQ ID NO:2483, SEQ ID NO:2485, SEQ ID NO:2486, SEQ ID NO:2487, SEQ ID 
NO:2488, SEQ ID NO:2491, SEQ ID NO:2492, SEQ ID NO:2493, SEQ ID NO:2494, SEQ ID 
NO:2495, SEQ ID NO:2496, SEQ ID NO:2497, SEQ ID NO:2502, SEQ ID NO:2503, SEQ ID 
NO:2504, SEQ ID NO:2505, SEQ ID NO:2506, SEQ ED NO:2507, SEQ ID NO:2508, SEQ ID 
NO:2509, SEQ ID NO:2510, SEQ ID NO:25l 1, SEQ ID NO-.2512, SEQ ID NO:2513, SEQ ID 
NO:2514, SEQ ID NO:2515, SEQ ID NO:2516, SEQ ID NO:2517, SEQ ID NO:2518, SEQ ID 
NO:2519, SEQ ID NO:2520, SEQ ID NO:2521, SEQ ID NO.-2528, SEQ ID NO:2529, SEQ ID 
NO.-2530, SEQ ID NO:2531, SEQ ID NO:2532, SEQ ID NO:2533, SEQ ID NO:2534, SEQ ID 
NO:2535, SEQ ID NO:2536, SEQ ID NO-.2537, SEQ ID NO:2538, SEQ ID NO:2539, SEQ ID 
NO.2540, SEQ ID NO.2541, SEQ ID NO:2542, SEQ ID NO:2543, SEQ ID NO:2544, SEQ ID 
NO:2545, SEQ ID NO:2546, SEQ ID NO:2547, SEQ ID NO:2548, SEQ ID NO:2549, SEQ ID 
NO:2550, SEQ ID NO:2551, SEQ ID NO:2552, SEQ ID NO:2553, SEQ ID NO:2554, SEQ ID 
NO:2555, SEQ ID NO:2556, SEQ ID NO:2557, SEQ ID NO:2558, SEQ ID NO:2559, SEQ ID 
NO:2560, SEQ ID NO:2561, SEQ ID NO:2562, SEQ ID NO:2563, SEQ ID NO:2564, SEQ ID 
NO:2565, SEQ ID NO:2566, SEQ ID NO:2567, SEQ ID NO:2568, SEQ ID NO:2569, SEQ ID 
NO:2570, SEQ ID NO:2571, SEQ ID NO:2572, SEQ ID NO:2573, SEQ ID NO:2574, SEQ ID 
NO:2575, SEQ ID NO:2576, SEQ ID NO:2577, SEQ ID NO-.2578, SEQ ID NO:2579, SEQ ID 
NO:2580, SEQ ID NO.2581, SEQ ID NO:2582, SEQ ID NO:2583, SEQ ID NO:2584, SEQ ID 
NO:2585, SEQ ID NO:2586, SEQ ID NO:2587, SEQ ID NO:2588, SEQ ID NO:2589, SEQ ID 
NO:2590, SEQ ID NO.-2591, SEQ ID NO:2592, SEQ ID NO:2593, SEQ ID NO:2594, SEQ ID 
NO:2595, SEQ ID NO:2596, SEQ ID NO:2597, SEQ ID NO-.2598, SEQ ID NO:2599, SEQ ID 
NO.2600, SEQ ID NO:2601, SEQ ID NO:2602, SEQ ID NO:2603, SEQ ID NO:2604, SEQ ID 
NO:2605, SEQ ID NO:2606, SEQ ID NO:2607, SEQ ID NO:2608, SEQ ID NO:2609, SEQ ID 
NO:2610, SEQ ID NO:261 1, SEQ ID NO:2612, SEQ ID NO.2613, SEQ ID NO:2614, SEQ ID 
NO:2615, SEQ ID NO:2616, SEQ IDNO:2617, SEQ ID NO:2618, SEQ ID NO:2619, SEQ ID 
NO:2620, SEQ ID NO:2621, SEQ ID NO:2622, SEQ ID NO:2623, SEQ ID NO:2624, SEQ ID 
NO:2625, SEQ ID NO:2626, SEQ ID NO:2925, SEQ ID NO:2926, SEQ ID NO:2927, SEQ ID 
NO:2928, SEQ ID NO:2929, SEQ ID NO:2930, SEQ ID NO:2932, SEQ ID NO:2933, SEQ ID 
NO:2935, SEQ ID NO:2936, SEQ ID NO:2937, SEQ ID NO:2938, SEQ ID NO.2939, SEQ ID 
NO:2941, SEQ ID NO:2942, SEQ ID NO:2943, SEQ ID NO:2945, SEQ ID NO.2946, SEQ ID 
NO:2947, SEQ ID NO:2948, SEQ ID NO:2949, SEQ ID NO:2950, SEQ ID NO:295 1, SEQ ID 
NO:2952, SEQ ID NO:2953, SEQ ID NO:2954, SEQ ID NO:2955, SEQ ID NO:2956, SEQ ID 
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NO:2957, SEQ ID NO:2959, SEQ ID NO:2960, SEQ ID NO.2961, SEQ ID NO:2962, SEQ ID 
NO:2963, SEQ ID NO:2964, SEQ ID NO.2965, SEQ ID NO:2966, SEQ ID NO:2967, SEQ ID 
NO:2968, SEQ ID NO:2969, SEQ ID NO:2970, SEQ ID NO:2971, SEQ ID NO:2972, SEQ ID 
NO:2973, SEQ ID NO:2974, SEQ ID NO:2975, SEQ ID NO:2976, SEQ ID NO:2977, SEQ ID 
NO.2978, SEQ ID NO:2979, SEQ ID NO:2980, SEQ ID NO:298l, SEQ ID NO:2982, SEQ ID 
NO:2983, SEQ ID NO:2984, SEQ ID NO:2985, SEQ ID NO:2986, SEQ ID NO:2987, SEQ ID 
NO:2988, SEQ ID NO:2989, SEQ ID NO:2990, SEQ ID NO:2991, SEQ ID NO:2992, SEQ ID 
NO:2993, SEQ ID NO:2994, SEQ ID NO:2995, SEQ ID NO:2996, SEQ ID NO:2997, SEQ ID 
NO:2998, SEQ ID NO:2999, SEQ ID NO:3000, SEQ ID NO:300l, SEQ ID NO:3002, SEQ ID 
NO:3003, SEQ ID NO:3004, SEQ ID NO:3005, SEQ ID NO:3006, SEQ ID NO:3007, SEQ ID 
NO:3008, SEQ ID NO:3009, SEQ ID NO:3010, SEQ ID NO:301 1, SEQ ID NO:3012, SEQ ID 
NO:3013, SEQ ID NO:3014, SEQ ID NO:3015. In another variation, the method includes detectin g 
the expression level of one or more additional genes by measuring one or more proteins expressed by 
the one or more additional genes. The one or more proteins expressed by the one or more genes 
comprises an amino acid sequence selected from SEQ ID NO:2400, SEQ ID NO:2401, SEQ ID 
NO:2402, SEQ ID NO:2403, SEQ ID NO:2404> SEQ ID NO:2405, SEQ ID NO:2406, SEQ ID 
NO:2407, SEQ ID NO:2408, SEQ ID NO:2409, SEQ ID NO:2410, SEQ ID NO:241 1, SEQ ID 
NO:2412, SEQ ID NO:2413, SEQ ID NO:2414, SEQ ID NO:2415, SEQ ID NO:2416, SEQ ID 
NO.2417, SEQ ID NO:2418, SEQ ID NO:2419, SEQ ID NO:2420, SEQ ID NO:2421, SEQ ID 
NO:2422, SEQ ID NO:2423, SEQ ID NO:2424, SEQ ID NO:2425, SEQ ID NO:2426, SEQ ID 
NO:2427, SEQ ID NO:2428, SEQ ID NO:2429, SEQ ID NO:2430, SEQ ID NO:2432, SEQ ID 
NO:2433, SEQ ID NO:2434, SEQ ID NO.2435, SEQ ID NO:2436, SEQ ID NO:2437, SEQ ID 
NO:2438, SEQ ID NO:2439, SEQ ID NO:2440, SEQ ID NO:2441, SEQ ID NO:2442, SEQ ID 
NO:2443, SEQ ID NO:2444, SEQ ID NO:2445, SEQ ID NO:2446, SEQ ID NO:2447, SEQ ID 
NO:2448, SEQ ID NO:2449, SEQ ID NO:2450, SEQ ID NO:2451, SEQ ID NO:2452, SEQ ID 
NO:2453, SEQ ID NO:2454, SEQ ID NO.2455, SEQ ID NO:2456, SEQ ID NO:2457, SEQ ID 
NO:2458, SEQ ID NO:2459, SEQ ID NO:2460, SEQ ID NO:2461 , SEQ ID NO:2462, SEQ ID 
NO:2463, SEQ ID NO:2464, SEQ ID NO:2465, SEQ ID NO:2466, SEQ ID NO:2467, SEQ ID 
NO:2468, SEQ ID NO:2469, SEQ IDNO:2470, SEQ ID NO:2474, SEQ ID NO:2478, SEQ ID 
NO:2479, SEQ ID NO:2480, SEQ ID NO-.2481, SEQ ID NO:2482, SEQ ID NO:2483, SEQ ID 
NO:2485, SEQ ID NO:2486, SEQ ID NO:2487, SEQ ID NO:2488, SEQ ID NO:2491, SEQ ID 
NO:2492, SEQ ID NO:2493, SEQ ID NO:2494, SEQ ID NO:2495, SEQ ID NO:2496, SEQ ID 
NO:2497, SEQ ID NO:2502, SEQ ID NO:2503, SEQ ID NO:2504, SEQ ID 1^0:2505, SEQ ID 
NO:2506, SEQ ID NO:2507, SEQ ID NO:2508, SEQ ID NO.2509, SEQ ID NO.2510, SEQ ID 
NO:251 1, SEQ ID NO:2512, SEQ ID NO:2513, SEQ ID NO:2514, SEQ ID NO:2515, SEQ ID 
NO:2516, SEQ ID NO:2517, SEQ ID NO:2518, SEQ ID NO:2519, SEQ ID NO:2520, SEQ ID 
NO:2521, SEQ ID NO:2528> SEQ ID NO:2529, SEQ ID NO:2530, SEQ ID NO:2531, SEQ ID 
NO:2532, SEQ ID NO:2533, SEQ ID NO:2534, SEQ ID NO:2535, SEQ ID NO:2536, SEQ ID 
NO-.2537, SEQ ID NO:2538, SEQ ID NO:2539, SEQ ID NO:2540, SEQ ID NO:2541, SEQ ID 
NO:2542, SEQ ID NO:2543, SEQ ID NO:2544, SEQ ID NO:2545, SEQ ID NO:2546, SEQ ID 
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NO:2547, SEQ ID NO:2548, SEQ ID NO:2549, SEQ ID NO:2550, SEQ ID NO:2551, SEQ ID 
NO:2552, SEQ ID NO:2553, SEQ ID NO:2554, SEQ ID NO.2555, SEQ ID NO:2556, SEQ ID 
NO:2557, SEQ ID NO:2558, SEQ ID NO:2559, SEQ ID NO:2560, SEQ ID NO:2561, SEQ ID 
NO:2562, SEQ ID NO:2563, SEQ ID NO:2564, SEQ ID NO:2565, SEQ ID NO:2566, SEQ ID 
NO:2567, SEQ ID N0.2568, SEQ ID NO:2569, SEQ ID NO:2570, SEQ ID NO:257l, SEQ ID 
NO:2572, SEQ ID NO:2573, SEQ ID NO:2574, SEQ ID NO:2575, SEQ ID NO:2576, SEQ ID 
NO.2577, SEQ ID NO.2578, SEQ ID NO:2579, SEQ ID NO:2580, SEQ ID NO:2581, SEQ ID 
N0.2582, SEQ ID NO:2583, SEQ ID NO.2584, SEQ ID NO:2585, SEQ ID NO:2586, SEQ ID 
NO:2587, SEQ ID NO:2588, SEQ ID NO:2589, SEQ ID NO:2590, SEQ ID NO:2591, SEQ ID 
NO:2592, SEQ ID NO:2593, SEQ ID NO.2594, SEQ ID NO:2595, SEQ ID NO:2596, SEQ ID 
NO:2597, SEQ ID NO:2598, SEQ ID NO:2599, SEQ ID NO:2600, SEQ ID NO:2601, SEQ ID 
NO:2602, SEQ ID NO:2603, SEQ ID NO:2604, SEQ ID NO:2605, SEQ ID NO:2606, SEQ ID 
NO:2607, SEQ ID NO:2608, SEQ ID NO:2609, SEQ ID NO:2610, SEQ ID NO:261 1, SEQ ID 
NO:2612, SEQ ID NO:2613, SEQ ID NO:2614, SEQ ID NO:2615, SEQ ID NO:2616, SEQ ID 
NO.2617, SEQ ID N0.2618, SEQ ID NO:2619, SEQ ID NO:2620, SEQ ID NO:2621, SEQ ID 
NO:2622, SEQ ID NO:2623, SEQ ID NO:2624, SEQ ID NO:2625, SEQ ID NO:2626, SEQ ID 
NO:2925, SEQ ID NO:2926, SEQ ID NO:2927, SEQ ID NO:2928, SEQ ID NO:2929, SEQ ID 
NO:2930, SEQ ID NO:2932, SEQ ID NO:2933, SEQ ID NO:2935, SEQ ID NO:2936, SEQ ID 
NO:2937, SEQ ID NO:2938, SEQ ID NO:2939, SEQ ID NO:2941, SEQ ID NO:2942, SEQ ID 
NO:2943, SEQ ID NO:2945, SEQ ID NO:2946, SEQ ID NO:2947, SEQ ID NO:2948, SEQ ID 
NO:2949, SEQ ID NO:2950, SEQ ID NO:2951, SEQ ID NO:2952, SEQ ID NO:2953, SEQ ID 
NO:2954, SEQ ID NO:2955, SEQ ID NO:2956, SEQ ID NO:2957, SEQ ID NO:2959, SEQ ID 
NO:2960, SEQ ID NO:2961, SEQ ID NO:2962, SEQ ID NO:2963, SEQ ID NO:2964, SEQ ID 
NO:2965, SEQ ID NO:2966, SEQ ID NO:2967, SEQ ID NO:2968, SEQ ID NO:2969, SEQ ID 
NO:2970, SEQ ID NO:2971, SEQ ID NO:2972, SEQ ID NO:2973, SEQ ID NO:2974, SEQ ID 
NO:2975, SEQ ID NO:2976, SEQ ID NO:2977, SEQ ID NO:2978, SEQ ID NO:2979, SEQ ID 
NO:2980, SEQ ID NO:2981, SEQ ID NO:2982, SEQ ID NO:2983, SEQ ID NO:2984, SEQ ID 
NO:2985, SEQ ID NO:2986, SEQ ID NO:2987, SEQ ID NO:2988, SEQ ID NO:2989, SEQ ID 
NO:2990, SEQ ID NO:2991, SEQ ID NO:2992, SEQ ID NO:2993, SEQ ID NO:2994, SEQ ED 
NO:2995, SEQ ID NO:2996, SEQ ID NO:2997, SEQ ID NO:2998, SEQ ID NO:2999, SEQ ID 
NO:3000, SEQ ID NO:3001, SEQ ID NO:3002, SEQ ID NO:3003, SEQ ID NO.3004, SEQ ID 
NO:3005, SEQ ID NO:3006, SEQ ED NO:3007, SEQ ID NO:3008, SEQ ID NO:3009, SEQ ID 
NO:3010, SEQ ID N0:301 1, SEQ ID NO:3012, SEQ ID NO:3013, SEQ ID NO:30I4, SEQ ID 
NO:3015, and the one or more proteins expressed by the one or more additional genes may include an 
amino acid sequence selected from SEQ ID NO:2431, SEQ ID NO:2471, SEQ ID NO:2472, SEQ ID 
NOI2473, SEQ ID NO:2475, SEQ ID NO:2476, SEQ ID NO:2477, SEQ ID NO:2484, SEQ ID 
NO:2489, SEQ ID NO.2490, SEQ ID NO:2498, SEQ ID NO:2499, SEQ ID NO:2500, SEQ ID 
NO:2501, SEQ ID NO:2522, SEQ ID NO:2523, SEQ ID NO.2524, SEQ ID NO:2525, SEQ ID 
NO:2526, SEQ ID NO:2527. 
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Protein detection may be accomplished by measuring serum. In another variation, the protein 
is a cell surface protein. In a further variation, the measuring includes using a fluorescent activated cell 
sorter. 

In another aspect, the invention is directed to a substantially purified oligonucleotide having 
the nucleotide sequence selected from SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, 
SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO.8, SEQ ID NO:9, SEQ ID NO: 1 0, SEQ ID NO: 1 1 , SEQ ID 
NO: 1 2, SEQ ID NO: 1 3, SEQ ID NO: 1 4. SEQ ID NO: 15, SEQ ID NO: 1 6, SEQ ID NO: 1 7, SEQ ID 
NO: 18, SEQ ID NO: 19, SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID 
NO:24, SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID N0.28, SEQ ID NO:29, SEQ ID 
NO:30, SEQ ID NO:31, SEQ ID NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO:35, SEQ ID 
NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ ID NO:40, SEQ ID NO:41, SEQ ID 
NO:42, SEQ ID NO:43, SEQ ID NO:44, SEQ ID NO:45, SEQ ID NO:46, SEQ ID NO:47, SEQ ID 
NO:48, SEQ ID NO:49, SEQ ID NO:50, SEQ ID NO:51, SEQ ID NO:52, SEQ ID NO:S3, SEQ ID 
NO:54, SEQ ID NO:55, SEQ ID NO:56, SEQ ID NO:57, SEQ ID NO:58, SEQ ID NO:59, SEQ ID 
NO:60, SEQ ID NO:61, SEQ ID NO:62, SEQ ID NO:63, SEQ ID NO:64, SEQ ID NO:65, SEQ ID 
NO:66, SEQ ID NO:67, SEQ ID NO:68, SEQ ID NO:69, SEQ ID NO:70, SEQ ID NO:71, SEQ ID 
NO:72, SEQ ID NO:73, SEQ ID NO:74, SEQ ID NO:75, SEQ ID NO:76, SEQ ID NO:77, SEQ ID 
NO:78, SEQ ID NO:79, SEQ ID NO:80, SEQ ID NO:8 1 , SEQ ID NO:82, SEQ ID NO:83, SEQ ID 
NO:84, SEQ ID NO:85, SEQ ID NO:86, SEQ ID NO:87, SEQ ID NO:88, SEQ ID NO:89, SEQ ED 
NO:90, SEQ ID NO:91, SEQ ID NO:92, SEQ ID NO:93, SEQ ID NO:94, SEQ ID NO:95, SEQ ID 
NO:96, SEQ ID NO:97, SEQ ID NO:98, SEQ ID NO:99, SEQ ID NO: 100, SEQ ID NO: 101, SEQ ID 
NO.102, SEQ ID NO-.103, SEQ ID NO.104, SEQ ID NO:105, SEQ ID NO: 106, SEQ ID NO.107, SEQ 
ID NO:108, SEQ ID NO:109, SEQ ID NO:l 10, SEQ ID NO:l 1 1, SEQ ID NO:l 12, SEQ ID NO:l 13, 
SEQ ID NO: 1 1 4, SEQ ID NO: 1 1 5, SEQ ID NO: 1 1 6, SEQ ID NO: 1 1 7, SEQ ID NO: 1 1 8, SEQ ID 
NO:l 19, SEQ ID NO:120, SEQ ID NO:121, SEQ ID NO:122, SEQ ID NO: 123, SEQ ED NO-.124, SEQ 
ED NO: 1 25, SEQ ID NO: 1 26, SEQ ID NO: 1 27, SEQ ID NO: 1 28, SEQ ID NO: 1 29, SEQ ID NO: 130, 
SEQ ID NO.131, SEQ IDNO:132, SEQ ED NO:133, SEQ ID NO: 134, SEQ ID NO:135, SEQ ID 
NO:136, SEQ ID NO:137, SEQ ID NO:138, SEQ ID NO:139, SEQ ID NO:140, SEQ IDNO:141, SEQ 
ID NO: 1 42, SEQ ID NO: 1 43, SEQ ID NO: 1 44, SEQ ID NO: 1 45, SEQ ID NO: 1 46, SEQ ID NO: 147, 
SEQ ID NO:148, SEQ ED NO:149, SEQ ID NO.1S0, SEQ ED NO:151, SEQ ID NO:152, SEQ ID 
NO: 153, SEQ ID NO: 154, SEQ ID NO: 155, SEQ ID NO: 156, SEQ ID NO: 157, SEQ ID NO: 158, SEQ 
ID NO: 1 59, SEQ ID NO: 1 60, SEQ ID NO: 1 6 1 , SEQ ID NO: 1 62, SEQ ID NO: 1 63, SEQ ID NO: 1 64, 
SEQ ID NO: 1 65, SEQ ID NO: 1 66, SEQ ID NO: 1 67, SEQ ID NO: 1 68, SEQ ID NO: 1 69, SEQ ID 
NO:170, SEQ EDNO:171, SEQ ID NO:172, SEQ IDNO.173, SEQ IDNO:174, SEQEDNO:175, SEQ 
ID NO: 1 76, SEQ ID NO: 1 77, SEQ ED NO: 1 78, SEQ ID NO. 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 , 
SEQ ID NO: 1 82, SEQ ID NO: 1 83, SEQ ID NO: 1 84, SEQ ID NO: 1 85, SEQ ID NO: 1 86, SEQ ID 
NO: 187, SEQ ID NO: 188, SEQ ID NO: 189, SEQ ID NO: 190, SEQ ID NO: 191, SEQ ID NO. 192, SEQ 
ID NO: 193, SEQ ID NO: 194, SEQ ID NO. 195, SEQ ID NO: 196, SEQ ID NO: 197, SEQ ID NO: 198, 
SEQ ID NO: 199, SEQ ID NO:200, SEQlD NO:201,.SEQ ID NO:202, SEQ ID NO:203, SEQ ID 
NO:204, SEQ ED NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ 
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ID NO:210, SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID NO: 2 13, SEQ ID NO:214, SEQ ID NO:215, 
SEQ ID NO:216, SEQ ID NO:2l7, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID 
NO:22 1, SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ 
ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:23 1 , SEQ ID NO:232, 
SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID N0.237, SEQ ID 
NO:238, SEQ ID NO.239, SEQ ID NO:240, SEQ ID NO:241, SEQ ID NO:242, SEQ ID NO:243, SEQ 
ID NO:244 ; SEQ. ID NO:245, SEQJD NO:246, SEQ ID NO:247, SEQ ID NO:248, SEQ JD NO:249, 
SEQ ID NO:250, SEQ ID NO:25I, SEQ ID NO:252, SEQ ID NO:253, SEQ ID NO:254, SEQ ID 
NO:255, SEQ ID NO:256, SEQ ID NO:257, SEQ ID NO:258, SEQ ID NO:259, SEQ ED NO:260, SEQ 
ID NO:261, SEQ ID NO:262, SEQ ID NO:263, SEQ ID NO:264, SEQ ID NO:265, SEQ ID NO:266, 
SEQ ID NO:267, SEQ ID NO:268, SEQ ID NO:269, SEQ ID NO:270, SEQ ID NO:271, SEQ ID 
NO:272, SEQ ID NO:273, SEQ ID NO:274, SEQ ID NO:275, SEQ ID NO:276, SEQ ID NO:277, SEQ 
ID NO:278, SEQ ID NO:279, SEQ ID NO.280, SEQ ID NO: 281, SEQ ID NO:282, SEQ ID NO:283, 
SEQ ID NO:284, SEQ ID NO.285, SEQ ID NO:286, SEQ ID NO:287, SEQ ID NO:288, SEQ ID 
NO:289, SEQ ID NO.290, SEQ ID NO:291, SEQ ID NO:292, SEQ ID NO:293, SEQ ID NO:294, SEQ 
ID NO:295, SEQ ID NO:296, SEQ ID NO:297, SEQ ID NO:298, SEQ ID NO:299, SEQ ID NO:300, 
SEQ ID NO.301, SEQ ID NO:302, SEQ ID NO:303, SEQ ID NO:304, SEQ ID NO:305, SEQ ID 
NO:306, SEQ ID NO:307, SEQ ID NO.308, SEQ ID NO.309, SEQ ID NO:310, SEQ ID NO:31 1, SEQ 
ID NO:312, SEQ ID NO:313, SEQ ID NO:314, SEQ IDNO:315, SEQ ID NO:316, SEQ ID NO:317, 
SEQ ID NO:318, SEQ ID NO:319, SEQ ID NO:320, SEQ ID NO:321, SEQ ID NO:322, SEQ ID 
NO:323, SEQ ID NO:324, SEQ ID NO:325, SEQ ID NO:326, SEQ ID NO:327, SEQ ID NO:328, SEQ 
ID NO:329, SEQ ID NO:330, SEQ ID NO:331, SEQ ID NO:332, SEQ ID NO:2697, SEQ ID 
NO:2645, SEQ ID NO:2707, SEQ ID NO:2679, SEQ ID NO:271 7, SEQ ID NO:2646, SEQ ID 
NO:2667, SEQ ID NO:2706, SEQ ID NO:2740, SEQ ID NO:2669, SEQ ID NO:2674, SEQ ID 
NO:2743, SEQ ID NO:2716, SEQ ID NO:2727, SEQ ID NO:2721, SEQ ID NO:2641, SEQ ID 
NO:2671, SEQ ID NO:2752, SEQ ID NO.2737, SEQ ID NO:2719, SEQ ID NO:2684, SEQ ID 
NO:2677, SEQ ID NO:2748, SEQ ID NO:2703, SEQ ID NO:271 1, SEQ ID NO:2663, SEQ ID 
NO:2657, SEQ ID NO:2683, SEQ ID NO:2686, SEQ ID NO:2687, SEQ ID NO:2644, SEQ ID 
NO:2664, SEQ ID NO:2747, SEQ ID NO:2744, SEQ ID NO:2678, SEQ ID NO:2731, SEQ ID 
NO:2713, SEQ ID NO:2736, SEQ ID NO:2708, SEQ ID NO.2670, SEQ ID NO:2661, SEQ ID 
NO:2680, SEQ ID NO:2754, SEQ ID NO:2728, SEQ ID NO:2742, SEQ ID NO:2668, SEQ ID 
NO:2750, SEQ ID NO:2746, SEQ ID NO:2738, SEQ ID NO:2627, SEQ ID NO:2739, SEQ ID 
NO:2647, SEQ ID NO:2628, SEQ ID NO:2638, SEQ ID NO:2725, SEQ ID NO:27l4, SEQ ID 
NO:2635, SEQ ID NO:2751, SEQ ID NO:2629, SEQ ID NO:2695, SEQ ID NO:2741, SEQ ID 
NO:2691, SEQ ID NO:2726, SEQ ID NO:2722, SEQ ID NO:2689, SEQ ID NO:2734, SEQ ID 
NO:2631, SEQ IDNO:2656, SEQ ID NO:2696, SEQ ID NO:2676, SEQ ID NO:2701, SEQ ID 
NO:2730, SEQ ID NO:2710, SEQ ID NO:2632, SEQ ID NO:2724, SEQ ID NO:2698, SEQ ID 
NO:2662, SEQ ID NO:2753, SEQ ID NO.2704, SEQ ID NO:2675, SEQ ID NO:2700, SEQ ID 
NO;2640, SEQ ID NO:2723, SEQ ID NO:2658, SEQ ID NO:2688, SEQ ID NO;2735, SEQ ID 
NO:2702, SEQ ID NO:268I, SEQ ID NO:2755, SEQ ID NO:2715, SEQ ID NO:2732, SEQ ID 
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NO:2652, SEQ ID NO:2651, SEQ ID NO:2718, SEQ ID NO:2673, SEQ ID NO:2733, SEQ ID 
NO:27I2, SEQ ID NO:2659, SEQ ID NO:2654, SEQ ID NO:2636, SEQ ID NO:2639, SEQ ID 
NO:2690, SEQ ID NO:2705, SEQ ID NO:2685, SEQ ID NO:2692, SEQ ID NO:2693, SEQ ID 
NO:2648, SEQ ID NO:2650, SEQ ID NO:2720, SEQ ID NO:2660, SEQ JD NO:2666; SEQ ID 
NO:2699, SEQ ID NO:2633, SEQ ID NO:2672, SEQ ID NO:2642, SEQ ID NO:2682, SEQ ID 
NO:2655, SEQ ID NO:2630, SEQ ID NO:2745, SEQ ID NO:2643, SEQ ID NO:2694, SEQ ID 
NO:2749, SEQ ID NO:2665, SEQ ID NO:2649, SEQ ID NO:2637, SEQ ID NO:2634, SEQ ID 
NO:2709, SEQ ID NO:2653, SEQ ID NO:2729, a substantially purified oligonucleotide having the 
nucleotide sequence selected from SEQ ID NO:333-664, and substantially purified oligonucleotides 
having at least 90% sequence identity to an oligonucleotide having the nucleotide sequence selected 
from SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ 
ID NO:8, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:l 1, SEQ ID NO:12, SEQ ID NO:13, SEQ ID 
NO: 14, SEQ ID NO:15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 1 8, SEQ ID NO: 19, SEQ ID 
NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:25, SEQ ID 
NO:26, SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:29, SEQ ID NO:30, SEQ ID NO:31, SEQ ID 
NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:37, SEQ ID 
NO:38, SEQ ID NO:39, SEQ ID NO:40, SEQ ID NO:41, SEQ ID NO:42, SEQ ID NO:43, SEQ ID 
NO:44, SEQ ID NO:45, SEQ ID NO:46, SEQ ID NO:47, SEQ ID NO:48, SEQ ID NO:49, SEQ ID 
NO:50, SEQ ID NO:51, SEQ ID NO:52, SEQ ID NO:53, SEQ ID NO:54, SEQ ID NO:55, SEQ ID 
NO:56, SEQ ID NO:57, SEQ ID NO:58, SEQ ID NO:59, SEQ ID NO:60, SEQ ID NO:61, SEQ ID 
NO:62, SEQ ID NO:63, SEQ ID NO:64, SEQ ID NO:65, SEQ ID NO:66, SEQ ID NO:67, SEQ ID 
NO:68, SEQ ID NO:69, SEQ ID NO:70, SEQ ID NO:71, SEQ ID NO:72, SEQ ID NO:73, SEQ ID 
NO:74, SEQ ID NO:75, SEQ ID NO:76, SEQ ID NO:77, SEQ ID NO:78, SEQ ID NO:79, SEQ ID 
NO:80, SEQ ID NO:81, SEQ ID NO:82, SEQ ID NO:83, SEQ ID NO:84, SEQ ID NO:85, SEQ ID 
NO:86, SEQ ID NO:87, SEQ ID NO:88, SEQ ID NO:89, SEQ ID NO:90, SEQ ID NO:91, SEQ ID 
NO:92, SEQ ID NO:93, SEQ ID NO:94, SEQ ID NO:95, SEQ ID NO:96, SEQ ID NO:97, SEQ ID 
NO:98, SEQ ID NO:99, SEQ ID NO:100, SEQ ID NO:101, SEQ ID NO:102, SEQ ID NO:103, SEQ 
ID NO: 104, SEQ ID NO: 105, SEQ ID NO: 106, SEQ ID NO: 107, SEQ ID NO: 108, SEQ ID NO: 109, 
SEQ ID NO: 1 1 0, SEQ ID NO: 1 1 1 , SEQ ID NO: 1 1 2, SEQ ID NO: 1 13, SEQ ID NO: 1 1 4, SEQ ID 
NO:l 15, SEQ ID NO:l 16, SEQ ID NO:l 17, SEQ ID NO:l 18, SEQ ID NO:l 19, SEQ ID NO: 120, SEQ 
ID NO:121, SEQ ID NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, 
SEQ ID NO: 127, SEQ ID NO: 128, SEQ ID NO: 129, SEQ ID NO: 130, SEQ ID NO: 131, SEQ ID 
NO: 132, SEQ ID NO: 133, SEQ ID NO: 134, SEQ ID NO: 135, SEQ ID NO: 136, SEQ ID NO: 137, SEQ 
ID NO: 138, SEQ IDNO:139, SEQ ID NO:140, SEQ ID NO:141, SEQ ID NO: 142, SEQ IDNO:143, 
SEQ ID NO: 144, SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 147, SEQ ID NO: 148, SEQ ID 
NO: 1 49, SEQ ID NO: 1 50, SEQ ID NO: 151, SEQ ID NO: 1 52, SEQ ID NO: 1 53, SEQ ID NO: 1 54, SEQ 
ID NO:155, SEQ ID NO:156, SEQ ID NO:157, SEQ ID NO:158, SEQ ID NO:159, SEQ ID NO:160, 
SEQ ID NO: 161, SEQ ID NO: 162, SEQ ID NO: 163, SEQ ID NO: 164, SEQ ID NO: 165, SEQ ID 
NO:166, SEQ ID NO:167, SEQ ID NO:168, SEQ ID NO:169, SEQ ID NO: 170, SEQ ID NO:171, SEQ 
ID NO: 1 72, SEQ ID NO: 1 73, SEQ ID NO: 1 74, SEQ ID NO: 1 75, SEQ ID NO: 1 76, SEQ ID NO: 1 77, 
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SEQ ID NO: 178, SEQ ID NO: 179, SEQ ID NO: 180, SEQ ID NO: 181, SEQ ID NO: 182, SEQ ID 
NO: 1 83, SEQ ID NO: 1 84, SEQ ID NO: 1 85, SEQ ID NO: 1 86, SEQ ID NO: 1 87, SEQ ID NO: 1 88, SEQ 
ID NO: 1 89, SEQ ID NO: 1 90, SEQ ID NO: 1 9 1 , SEQ ID NO: 1 92, SEQ ID NO: 1 93 , SEQ ID NO: 1 94, 
SEQ ID NO: 195, SEQ ID NO: 196, SEQ ID NO:i97, SEQ ID NO: 198, SEQ ID NO: 199, SEQ ID 
NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ 
ID NO.206, SEQ ID NO.207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1, 
SEQJD NO:212, SEQ ID NO:213^SEQ ID NO:214, SEQ ID NO:2 15, SEQ ID NO:2 16, SEQ ID 
NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ 
ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO.226, SEQ ID NO:227, SEQ ID NO:228, 
SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID NO:233, SEQ ID 
NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ 
ID NO:240, SEQ ID NO:241, SEQ ID NO:242, SEQ ID NO:243, SEQ ID NO:244, SEQ ID NO:245, 
SEQ ID NO:246, SEQ ID NO:247, SEQ ID NO:248, SEQ ID NO.249, SEQ ID NO:250, SEQ ID 
N0.251, SEQ ID NO:252, SEQ ID NO:253, SEQ ID NO:254, SEQ ID NO:255, SEQ ID NO:256, SEQ 
ID NO:257, SEQ ID NO:258, SEQ ID NO:259, SEQ ID NO:260, SEQ ID NO:261, SEQ ID NO:262, 
SEQ ID NO:263, SEQ ID NO:264, SEQ ID NO:265, SEQ ID NO:266, SEQ ID NO:267, SEQ ID 
N0.268, SEQ ID NO:269, SEQ ID NO:270, SEQ ID NO:271, SEQ ID NO:272, SEQ ID NO:273,*SEQ 
ID NO:274, SEQ ID NO:275, SEQ ID NO:276, SEQ ID NO:277, SEQ ID NO:278, SEQ ID NO:279, 
SEQ ID NO:280, SEQ ID NO:281, SEQ ID NO:282, SEQ ID NO:283, SEQ ID NO:284, SEQ ID 
NO:285, SEQ ID NO:286, SEQ ID NO:287, SEQ ID NO:288, SEQ ID NO:289, SEQ ID NO:290, SEQ 
ID NO:291, SEQ ID NO:292, SEQ ID NO:293, SEQ ID NO.294, SEQ ID NO:295, SEQ ID NO:296, 
SEQ ID NO:297, SEQ ID NO:298, SEQ ID NO:299, SEQ ED NO:300, SEQ ID NO:301, SEQ ID 
NO:302, SEQ ID NO:303, SEQ ID NO:304, SEQ ID NO:305, SEQ ID NO:306, SEQ ID NO:307, SEQ 
ID NO:308, SEQ ID NO:309, SEQ ID NO:310, SEQ ID NO:31 1, SEQ ID NO:312, SEQ ID NO:313, 
SEQ ID NO:314, SEQ ID NO:315, SEQ ID NO:316, SEQ ID NO:317, SEQ ID NO:318, SEQ ID 
NO:319, SEQ ID NO:320, SEQ ID NO:321, SEQ ID NO:322, SEQ ID NO:323, SEQ ID NO:324, SEQ 
IDNO:325, SEQ ID NO:326, SEQ ID NO:327, SEQ ID NO:328, SEQ ID NO:329, SEQ ID NO:330, 
SEQ ID NO:331, SEQ ID NO:332, SEQ ID NO:2697, SEQ ID NO: 2645, SEQ ID NO:2707, SEQ ID 
NO:2679, SEQ ID NO:2717, SEQ ID NO:2646, SEQ ID NO:2667, SEQ ED NO:2706, SEQ ID 
NO:2740, SEQ ID NO:2669, SEQ ID NO:2674, SEQ ID NO:2743, SEQ ID NO.2716, SEQ ID 
NO.2727, SEQ ID NO:2721, SEQ ID NO:2641, SEQ ID NO:2671, SEQ ID NO:2752, SEQ ID 
NO:2737, SEQ ID NO:2719, SEQ ID NO:2684, SEQ ID NO:2677, SEQ ID NO:2748, SEQ ID 
NO:2703, SEQ ID NO:271 1 , SEQ ID NO:2663, SEQ ID NO:2657, SEQ ID NO:2683, SEQ ID 
N0.2686, SEQ ID NO:2687, SEQ ID NO.2644, SEQ ID NO.2664, SEQ ID NO:2747, SEQ ID 
NO:2744, SEQ ID NO:2678, SEQ ID NO:2731, SEQ ID NO:2713, SEQ ID NO:2736, SEQ ID 
NO:2708, SEQ ID NO:2670, SEQ ID NO.2661, SEQ ID NO:2680, SEQ ID NO:2754, SEQ ID 
NO:2728, SEQ ID NO:2742, SEQ ID NO:2668, SEQ ID NO:2750, SEQ ID NO.2746, SEQ ID 
NO:2738, SEQ ID NO:2627, SEQ ID NO.2739, SEQ ID NO:2647, SEQ ID NO:2628, SEQ ID 
NO:2638,.SEQ ID NO:2725, SEQ ID NO:2714, SEQ II? NO:2635, SEQ ID NO:275 1, SEQ ID 
NO:2629, SEQ ID NO:2695, SEQ ID NO:2741, SEQ ID NO:2691, SEQ ID NO:2726, SEQ ID 
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NO:2722, SEQ ID NO:2689, SEQ ID NO:2734, SEQ ID NO:2631, SEQ ID NO:2656, SEQ ID 
NO:2696, SEQ ID NO:2676, SEQ ID NO:2701, SEQ ID NO:2730, SEQ ID NO:2710, SEQ ID 
NO:2632, SEQ ID NO:2724, SEQ ID NO:2698, SEQ ID NO:2662, SEQ ID NO:2753, SEQ ID 
NO:2704, SEQ ID NO:2675, SEQ ID NO:2700, SEQ ID NO:2640, SEQ ID N0.2723, SEQ ID 
NO:2658, SEQ ID NO.2688, SEQ ID NO.2735, SEQ ID NO.2702, SEQ ID NO:2681, SEQ ID 
NO:2755, SEQ ID NO:2715, SEQ ID NO:2732, SEQ ID NO:2652, SEQ ID NO:265 1, SEQ ID 
NO:2718, SEQ ID NO:2673, SEQ ID NO:2.733, SEQ ID NO:27 1 2, SEQ ID NO:2659, SEQ ID . 
NO:2654, SEQ ID NO:2636, SEQ ID NO:2639, SEQ ID NO:2690, SEQ ID NO:2705, SEQ ID 
NO:2685, SEQ ID NO:2692, SEQ ID NO:2693, SEQ ID NO:2648, SEQ ID NO:2650, SEQ ID 
NO:2720, SEQ ID NO:2660, SEQ ID NO:2666, SEQ ID NO:2699, SEQ ID NO:2633, SEQ ID 
NO:2672, SEQ ID NO:2642, SEQ ID NO:2682, SEQ ID NO:2655, SEQ ID NO:2630, SEQ ID 
NO:2745, SEQ ID NO:2643, SEQ ID NO:2694, SEQ ID NO:2749, SEQ ID NO:2665, SEQ ID 
NO:2649, SEQ ID NO:2637, SEQ ID NO.2634, SEQ ID NO.2709, SEQ ID NO:2653, SEQ ID 
NO.2729 and/or SEQ ID NO:333-664. In a further aspect, the invention is directed to a substantially 
purified oligonucleotide that hybridizes at high stringency to an oligonucleotide having the nucleotide 
sequence selected from SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, 
SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO.9, SEQ ID NO: 10, SEQ ID NO.l 1, SEQ ID NO. 12, SEQ 
ID NO. 13, SEQ ID NO. 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID 
NO: 19, SEQ ID NO:20, SEQ ID NO:2 1 , SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID 
NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:29, SEQ ID NO:30, SEQ ID 
NO:31, SEQ ID NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO:35, SEQ ID NO:36, SEQ ID 
NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ ID NO:40, SEQ ID NO:41, SEQ ID NO:42, SEQ ID 
NO:43, SEQ ID NO:44, SEQ ID NO:45, SEQ ID NO:46, SEQ ID NO:47, SEQ ID NO:48, SEQ ID 
NO:49, SEQ ID NO:50, SEQ ID NO:51, SEQ ID NO:52, SEQ ID NO:53, SEQ ID NO:54, SEQ ID 
NO:55, SEQ ID NO:56, SEQ ID NO:57, SEQ ID NO:58, SEQ ID NO:59, SEQ ID NO:60, SEQ ID 
NO:61, SEQ ID NO:62, SEQ ID NO:63, SEQ ID NO:64, SEQ ID NO:65, SEQ ID NO:66, SEQ ID 
NO:67, SEQ ID NO:68, SEQ ID NO:69, SEQ ID NO:70, SEQ ID NO:71, SEQ ID NO:72, SEQ ID 
NO:73, SEQ ID NO:74, SEQ ID NO:75, SEQ ID NO:76, SEQ ID NO:77, SEQ ID NO:78, SEQ ID 
NO:79, SEQ ID NO:80, SEQ ID NO:81, SEQ ID NO:82, SEQ ID NO:83, SEQ ID NO.84, SEQ ID 
NO:85, SEQ ID NO:86, SEQ ID NO:87, SEQ ID NO:88, SEQ ID NO:89, SEQ ID NO:90, SEQ ID 
NO:91, SEQ IDNO:92, SEQ ID NO:93, SEQ ID NO:94, SEQ ID NO:95, SEQ ID NO:96, SEQ ID 
NO:97, SEQ ID NO:98, SEQ ID NO:99, SEQ ID NO: 100, SEQ ID NO: 101, SEQ ID NO: 102, SEQ ID 
NO:103, SEQ ID NO:104, SEQ ID NO:105, SEQ ID NO.106, SEQ ID NO.107, SEQ ID NO.108, SEQ 
ID NO. 109, SEQ ID NO: 1 10, SEQ ID NO: 1 1 1 , SEQ ID NO: 1 12, SEQ ID NO: 1 13, SEQ ID NO:l 14, 
SEQ ID NO: 1 1 5, SEQ ID NO: 1 1 6, SEQ ID NO: 1 1 7, SEQ ID NO: 1 1 8, SEQ ID NO: 1 1 9, SEQ ID 
NO: 120, SEQ ID NO: 121, SEQ ID NO: 122, SEQ ID NO: 123, SEQ ID NO: 124, SEQ ID NO: 125, SEQ 
ID NO:126, SEQ ID NO.127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID NO-.131, 
SEQ ID N0.132, SEQ ID NO.133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO:136, SEQ ID 
NO: 137, SEQ ID NO: 138, SEQ ID NO: 139, SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID NO: 142, SEQ 
ID NO: 143, SEQ ID NO: 144, SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 147, SEQ ID NO: 148, 
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SEQ ID NO: 149, SEQ ID NO. 1 50, SEQ ID NO. 151, SEQ ID NO: 1 52, SEQ ID NO: 1 53, SEQ ID 
NO. 1 54, SEQ ID NO: 1 55, SEQ ID NO. 1 56, SEQ ID NO: 1 57, SEQ ID NO: 1 58, SEQ ID NO: 1 59, SEQ 
ID NO: 1 60, SEQ ID NO: 1 6 1 , SEQ ID NO: 1 62, SEQ ID NO: 1 63, SEQ ID NO: 1 64, SEQ ID NO: 1 65, 
SEQ ID NO: 166, SEQ ID NO: 167, SEQ ID NO: 168, SEQ ID NO: 169; SEQ ID NO: 170, SEQ ID 
NO: 171, SEQ ID NO: 172, SEQ ID NO: 173, SEQ IDNO:174; SEQ ID NO: 175, SEQ ID NO: 176, SEQ 
ID NO: 1 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 , SEQ ID NO: 1 82, 
SEQ ID NO:183, SEQ ID NQ:184, SEQ ID NO:l85, SEQ ID NO:186, SEQ ID NO:187, SEQ ID 
NO: 188, SEQ ID NO: 189, SEQ ID NO: 190, SEQ ID NO: 191, SEQ ID NO: 192, SEQ ID NO: 193, SEQ 
ID NO: 194, SEQ ID NO:195, SEQ ID NO.196, SEQ ID NO.197, SEQ ID NO:198, SEQIDNO.199, 
SEQ ID NO:200, SEQ ID NO:201 , SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO.204, SEQ ID 
NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:2 10, SEQ 
ID NO:21 1, SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, 
SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID 
NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ 
ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO.232, SEQ ED NO:233, 
SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ED 
NO:239, SEQ ID NO:240, SEQ ID NO:241, SEQ ID NO:242, SEQ ID NO:243, SEQ ID NO-.244, SEQ 
ID NO:245, SEQ ID NO:246, SEQ ID NO:247, SEQ ED NO:248, SEQ ID NO:249, SEQ ID NO.-250, 
SEQ ID NO:251, SEQ ID NO:252, SEQ ID NO:253, SEQ ID NO:254, SEQ ID NO:255, SEQ ID 
NO:256, SEQ ID NO:257, SEQ ID NO:258, SEQ ID NO:259, SEQ ID NO:260, SEQ ID NO:261, SEQ 
ID NO:262, SEQ ID NO:263, SEQ ID NO:264, SEQ ID NO:265, SEQ ID NO-.266, SEQ ID NO.267, 
SEQ ID NO:268, SEQ ID NO:269, SEQ ID NO-.270, SEQ ID NO:271, SEQ ID NO:272, SEQ ID 
NO:273, SEQ IDNO:274, SEQ ID NO:275, SEQ ID NO:276, SEQ ID NO:277, SEQ ID NO.-278, SEQ 
ID NO:279, SEQ ED NO.-280, SEQ ID NO:281, SEQ ID NO:282, SEQ ID NO:283, SEQ ID NO:284, 
SEQ ID NO:285, SEQ ED NO:286, SEQ ID NO:287, SEQ ID NO:288, SEQ ID NO.289, SEQ ID 
NO.290, SEQ ID NO:291, SEQ ID NO:292, SEQ ID NO.293, SEQ ID NO:294, SEQ ID NO:295, SEQ 
ID NO:296, SEQ ID NO:297, SEQ ID NO:298, SEQ ID NO:299, SEQ ID NO:300, SEQ ID NO:301, 
SEQ ID NO:302, SEQ ID NO:303, SEQ ID NO:304, SEQ ID NO:305, SEQ ID NO.306, SEQ ID 
NO:307, SEQ ED NO:308, SEQ ID NO:309, SEQ ID NO:310, SEQ ID NO-.31 1, SEQ ID NO-.312, SEQ 
ID NO:313, SEQ ID NO:314, SEQ ED NO.315, SEQ ID NO.316, SEQ ID NO:317, SEQ ID NO:318, 
SEQ ID NO:319, SEQ ID NO:320, SEQ ID NO:321, SEQ ID NO:322, SEQ ID NO:323, SEQ ID 
NO:324, SEQ ID NO:325, SEQ ID NO:326, SEQ ID NO:327, SEQ ID NO:328, SEQ ID NO:329, SEQ 
ID NO:330, SEQ ID NO:331, SEQ ID NO:332, SEQ ID NO:2697, SEQ ID NO:2645, SEQ ID 
NO:2707, SEQ ID NO:2679, SEQ ID NO:2717, SEQ ID NO-.2646, SEQ ED NO:2667, SEQ ID 
NO.2706, SEQ ID NO:2740, SEQ ID NO:2669, SEQ ID NO:2674, SEQ ID NO:2743, SEQ ID 
NO:2716, SEQ ID NO:2727, SEQ ID NO:2721, SEQ ID NO:2641, SEQ ID NO:2671, SEQ ID 
NO:2752, SEQ ID NO:2737, SEQ ED NO:2719, SEQ ID NO:2684, SEQ ID NO:2677, SEQ ID 
NO:2748, SEQ ID NO:27()3, SEQ ID NO:271 1, SEQ lb NO.2663, SEQ ID NO:2657, SEQ ID 
NO-.2683, SEQ ID NO:2686, SEQ ID NO:2687, SEQ ID NO:2644, SEQ ID NO:2664, SEQ ID 
NO:2747, SEQ ID NO:2744, SEQ ID NO:2678, SEQ ID NO:2731, SEQ ID NO:2713, SEQ ID 
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NO:2736, SEQ ID NO:2708, SEQ ID NO.2670, SEQ ID NO-2661, SEQ ID NO:2680, SEQ ID 
NO.2754, SEQ ID NO.2728, SEQ ID NO:2742, SEQ ID NO:2668, SEQ ID NO:2750, SEQ ID 
NO:2746, SEQ ID NO.2738, SEQ ID NO:2627, SEQ ID NO:2739, SEQ ID NO:2647, SEQ ID 
NO:2628, SEQ ID NO:2638, SEQ ID NO:2725, SEQ ID NO:2714, SEQ ID NO:2635, SEQ ID 
NO:275)", SEQ ID NO:2629, SEQ ID NO:2695, SEQ ID NO-.2741, SEQ ID NO:2691, SEQ ID 
NO-2726, SEQ ID NO:2722, SEQ ID NO:2689, SEQ ID NO:2734, SEQ ID NO:263 1, SEQ ID 
NO:2656, SEQ ID NO:2696, SEQ.ID NO:2676,SEQ ID NO:270l,_SEQ ID NO:2730, SEQ ID 
NO:2710, SEQ ID NO:2632, SEQ ID NO:2724, SEQ ID NO:2698, SEQ ID NO:2662, SEQ ID 
NO:2753, SEQ ID NO:2704, SEQ ID NO.2675, SEQ ID NO.2700, SEQ ID NO.2640, SEQ ID 
NO:2723, SEQ ID NO:2658, SEQ ID NO:2688, SEQ ID NO.-2735, SEQ ID NO:2702, SEQ ID 
NO:2681, SEQ ID NO:2755, SEQ ID NO:2715, SEQ ID NO:2732, SEQ ID NO:2652, SEQ ID 
NO:2651, SEQ ID NO:27I8, SEQ ID NO:2673, SEQ ID NO:2733, SEQ ID NO:2712, SEQ ID 
NO:2659, SEQ ID NO:2654, SEQ ID NO:2636, SEQ ID NO:2639, SEQ ID NO.2690, SEQ ID 
NO.270S, SEQ ID NO:2685, SEQ ID NO:2692, SEQ ID NO:2693, SEQ ID NO:2648, SEQ ID 
NO:2650, SEQ ID NO:2720, SEQ ID NO:2660, SEQ ID NO:2666, SEQ ID NO:2699, SEQ ID 
NO:2633, SEQ ID NO:2672, SEQ ID NO:2642, SEQ ID NO:2682, SEQ ID NO:2655, SEQ ID 
NO:2630, SEQ ID NO:2745, SEQ ID NO:2643, SEQ ID NO.2694, SEQ ID NO:2749, SEQ ID 
NO:2665, SEQ ID NO.2649, SEQ ID NO:2637, SEQ ID NO:2634, SEQ ID NO:2709, SEQ ID 
NO:2653, SEQ ID NO:2729 or SEQ ED NOS:333-664. The sequences may be used as diagnostic 
oligonucleotides for transplant rejection and/or cardiac transplant rejection. The oligonucleotide may 
have nucleotide sequence including DNA, cDNA, PNA, genomic DNA, or synthetic oligonucleotides. 

In another aspect, the invention is directed to a method of diagnosing or monitoring transplant 
rejection in a patient wherein the expression level of one or more genes in a patient's bodily fluid is 
detected. In a further variation, the bodily fluid is peripheral blood. 

In another aspect, the invention is directed to a method of diagnosing or monitoring transplant 
rejection in a patient, comprising detecting the expression level of four or more genes in the patient to 
diagnose or monitor transplant rejection in the patient wherein the four or more genes include a 
nucleotide sequence selected from SEQ ID NO:2, SEQ ED NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ 
ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO:10, SEQ ID NO:l 1, SEQ ID 
NO:12, SEQ ID NO.13, SEQ ID NO.14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, SEQ ID 
NO: 18, SEQ ID NO: 19, SEQ ID NO.20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID 
NO:24, SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ED NO:28, SEQ ID NO:29, SEQ ED 
NO:30, SEQ ID NO:31, SEQ ED NO:32, SEQ ED NO:33, SEQ ED NO:34, SEQ ID NO.35, SEQ ID 
NO:36, SEQ ID NO:37, SEQ ID NO.38, SEQ ID NO:39, SEQ ID NO.40, SEQ ID NO:41, SEQ ED 
NO:42, SEQ ID NO:43, SEQ ID NO:44, SEQ ID NO:45, SEQ ID NO:46, SEQ ID NO:47, SEQ ID 
NO:48, SEQ ID NO:49, SEQ ID NO:50, SEQ ID NO:51, SEQ ID NO.52, SEQ ID NO:53, SEQ ID 
NO:54, SEQ ID NO:55, SEQ ID NO:56, SEQ ID NO:57, SEQ ID NO:58, SEQ ID NO:59, SEQ ID 
NO:60i SEQ ID NO:6 1 , SEQ ID NO.62, SEQ ID NO:63, SEQ ID NO:64, SEQ ID NO:65, SEQ ID 
NO:66, SEQ ID NO:67, SEQ ID NO:68, SEQ ID NO:69, SEQ ID NO:70, SEQ ID NO:71, SEQ ED 
NO:72, SEQ ID NO:73, SEQ ID NO:74, SEQ ID NO:75, SEQ ID NO:76, SEQ ID NO:77, SEQ ID 



41 



WO 2004/042346 



PCT/US2003/012946 



NO:78, SEQ ID NO:79, SEQ ID NO:80, SEQ ID NO:81, SEQ ID NO:82, SEQ ID NO:83, SEQ ID 
NO:84, SEQ ID NO:85, SEQ ID NO:86, SEQ ID NO:87, SEQ ID NO:88, SEQ ID NO:89, SEQ ID 
NO:90, SEQ ID NO:91, SEQ ID NO:92, SEQ ID NO:93, SEQ ID NO:94, SEQ ID NO:95, SEQ ID 
NO:96, SEQ ID NO:97, SEQ ID NO:98, SEQ ID NO:99, SEQ^ID NO: 100, SEQ ID NO: 101, SEQ ID 
NO:102, SEQ ID NO:103, SEQ ID NO:104, SEQ ID NO: 105, SEQ ID NO:106,SEQ ID NO:107, SEQ 
ID NO: 108, SEQ ID NO: 109, SEQ ID NO:l 10, SEQ ID NO: 1 1 1, SEQ ID NO: 1 12, SEQ ID NO:l 13, 
SEQ ID NO: 1 14, SEQ ID NO:J 1 5, SEQ ID NO: 11 6, SEQ ID NO: 1 1 7, SEQ ID NO: 1 1 8, SEQ ID 
NO: 1 1 9, SEQ ID NO: 1 20, SEQ ID NO: 1 2 1 , SEQ ID NO: 1 22, SEQ ID NO: 1 23, SEQ ID NO: 1 24, SEQ 
ID NO: 125, SEQ ID NO: 126, SEQ ID NO: 127, SEQ ID NO: 128, SEQ ID NO: 129, SEQ ID NO: 130, 
SEQ ID NO: 131, SEQ ID NO: 132, SEQ ID NO: 133, SEQ ID NO: 134, SEQ ID NO: 135, SEQ ID 
NO:136, SEQ ID NO:137, SEQ ID NO: 138, SEQ ID NO:139, SEQ ID NO:140, SEQ ID NO:141, SEQ 
ID NO: 142, SEQ ID NO: 143, SEQ ID NO: 144, SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 147, 
SEQ ID NO: 148, SEQ ID NO: 149, SEQ ID NO: 150, SEQ ID NO: 151, SEQ ID NO: 152, SEQ ID 
NO:153, SEQ ID NO:154, SEQ ID NO:155, SEQ ID NO:156, SEQ ID NO:157, SEQ ID NO:158, SEQ 
ID NO: 1 59, SEQ ID NO: 1 60, SEQ ID NO: 1 6 1 , SEQ ID NO: 1 62, SEQ ID NO: 1 63, SEQ ID NO: 1 64, 
SEQ ID NO:165, SEQ ID NO:166, SEQ ID NO:167, SEQ ID NO:168, SEQ ID NO:169, SEQ ID 
NO: 170, SEQ ID NO: 171, SEQ ID NO: 172, SEQ ID NO: 173, SEQ ID NO: 174, SEQ ID NO: 175, SEQ 
ID NO: 176, SEQ ID NO: 177, SEQ ID NO: 178, SEQ ID NO: 179, SEQ ID NO: 180, SEQ ID NO: 181, 
SEQ ID NO: 182, SEQ ID NO: 183, SEQ ID NO: 184, SEQ ID NO: 185, SEQ ID NO: 186, SEQ ID 
NO:187, SEQ ID NO:188, SEQ ID NO:189, SEQ ID NO:190, SEQ ID NO:191, SEQIDNO:192, SEQ 
ID NO:193, SEQ ID NO:194, SEQ ID NO:195, SEQ ID NO:196, SEQ ID NO:197, SEQ ID NO:198, 
SEQ ID NO: 199, SEQ ID NO:200, SEQ ID NO.201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID 
NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ 
ID NO:2 10, SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, 
SEQ ID NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID 
NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ 
ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, 
SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID 
NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, SEQ ID NO:242, SEQ ID NO:243, SEQ 
ID NO:244, SEQ ID NO:245, SEQ ID NO:246, SEQ ID NO:247, SEQ ID NO:248, SEQ ID NO:249, 
SEQ ID NO:250, SEQ ID NO:251, SEQ ID NO:252, SEQ ID NO:253, SEQ ID NO:254, SEQ ID 
NO:255, SEQ ID NO:256, SEQ ID NO:257, SEQ ED NO:258, SEQ ID NO:259, SEQ ID NO:260, SEQ 
ID NO:261, SEQ ID NO:262, SEQ ID NO:263, SEQ ID NO:264, SEQ ID NO:265, SEQ ID NO:266, 
SEQ ID NO:267, SEQ ID NO:268, SEQ ID NO:269, SEQ ID NO:270, SEQ ID NO:271, SEQ ID 
NO:272, SEQ ID NO:273, SEQ ID NO:274, SEQ ID NO:275, SEQ ID NO:276, SEQ ID NO:277, SEQ 
ID NO:278, SEQ ID NO:279, SEQ ID NO:280, SEQ ID NO:281, SEQ ID NO:282, SEQ ID NO:283, 
SEQ ID NO:284, SEQ ID NO:285, SEQ ID NO:286, SEQ ID NO:287, SEQ ID NO:288, SEQ ID 
NO:289, SEQ ID NO:290, SEQ ID NO:291, SEQ ID NO:292, SEQ ID NO:293, SEQ ID NO:294, SEQ 
ID NO:295, SEQ ID NO:296, SEQ ID NO:297, SEQ ID NO:298, SEQ ID NO:299, SEQ ID NO:300, 
SEQ ID NO:301, SEQ ID NO:302, SEQ ID NO:303, SEQ ID NO:304, SEQ ID NO:305, SEQ ID 
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NO:306, SEQ ID NO:307, SEQ ID NO:308, SEQ ID NO:309, SEQ ID NO.310, SEQ ID NO:31 1, SEQ 
ID NO:3 1 2, SEQ ID NO:3 1 3, SEQ ID NO:3 14, SEQ ID NO:3 1 5, SEQ ID N0:3 1 6, SEQ ID NO:3 1 7, 
SEQ ID NO:3 1 8, SEQ ID NO:3 1 9, SEQ ID NO:320, SEQ ID NO:32 1 , SEQ ID NO:322, SEQ ID 
NO:323, SEQ ID NO:324, SEQ ID NO:325, SEQ ID NO:326; SEQ ID NO:327, SEQ ID NO:328, SEQ 
ID NO:329, SEQ ID NO:330, SEqTd NO:33l, SEQ ID tfb:332, SEQ ID NO:2697, SEQ ID 
NO.2645, SEQ ID NO:2707, SEQ ID NO:2679, SEQ ID NO:2717, SEQ ID NO:2646, SEQ ID 
NO:2667, SEQ IDJNO:2706, SEQ ID NO;2740, SEQ ID NO:2669, SEQ ID NO:2674, SEQ ID 
NO:2743, SEQ ID NO:2716, SEQ ID NO:2727, SEQ ID NO:2721, SEQ ID NO.2641, SEQ ID 
NO:2671, SEQ ID NO:2752, SEQ ID NO:2737, SEQ ID NO:2719, SEQ ID NO:2684, SEQ ID 
NO:2677, SEQ ID NO:2748, SEQ ID NO:2703, SEQ ID NO:271 1, SEQ ID NO:2663, SEQ ID 
NO:2657, SEQ ID NO:2683, SEQ ID NO:2686, SEQ ID NO:2687, SEQ ID NO:2644, SEQ ID 
NO:2664, SEQ ID NO:2747, SEQ ID NO:2744, SEQ ID NO:2678, SEQ ID NO:2731, SEQ ID 
NO:2713, SEQ ID NO:2736, SEQ ID NO:2708, SEQ ID NO:2670, SEQ ID NO:2661, SEQ ID 
NO:2680, SEQ ID NO:2754, SEQ ID NO:2728, SEQ ID NO:2742, SEQ ID NO:2668, SEQ ID 
NO.2750, SEQ ID NO:2746, SEQ ID NO:2738, SEQ ID NO:2627, SEQ ID NO:2739, SEQ ID 
NO:2647, SEQ ID NO:2628, SEQ ID NO:2638, SEQ ID NO:2725, SEQ ID NO:2714, SEQ ID 
NO:2635, SEQ ID NO:2751, SEQ ID NO:2629, SEQ ID NO:2695, SEQ ID NO:2741, SEQ ID 
NO:269l, SEQ ID NO:2726, SEQ ID NO.2722, SEQ ID NO:2689, SEQ ID NO:2734, SEQ ID 
NO:2631, SEQ ID NO:2656, SEQ ID NO:2696, SEQ ID NO:2676, SEQ ID NO:2701, SEQ ID 
NO.2730, SEQ ID NO:2710, SEQ ID NO:2632, SEQ ID NO:2724, SEQ ID NO:2698, SEQ ID 
NO:2662, SEQ ID NO:2753, SEQ ID NO.2704, SEQ ID NO:2675, SEQ ID NO:2700, SEQ ID 
NO:2640, SEQ ID NO:2723, SEQ ID NO:2658, SEQ ID NO:2688, SEQ ID NO:2735, SEQ ID 
NO:2702, SEQ ID NO:2681, SEQ ID NO:2755, SEQ ID NO:2715, SEQ ID NO:2732, SEQ ID 
NO:2652, SEQ ID NO:2651, SEQ ID NO:2718, SEQ ID NO:2673, SEQ ID NO:2733, SEQ ID 
NO:2712, SEQ ID NO:2659, SEQ ID NO:2654, SEQ ID NO:2636, SEQ ID NO:2639, SEQ ID 
NO:2690, SEQ ID NO:2705, SEQ ID NO:2685, SEQ ID NO:2692, SEQ ID NO:2693, SEQ ID 
NO.2648, SEQ ID NO:2650, SEQ ID NO:2720, SEQ ID NO:2660, SEQ ID NO:2666, SEQ ID 
NO:2699, SEQ ID NO:2633, SEQ ID NO:2672, SEQ ID NO:2642, SEQ ID NO:2682, SEQ ID 
NO:2655, SEQ ID NO:2630, SEQ ID NO:2745, SEQ ID NO.2643, SEQ ID NO.2694, SEQ ID 
NO:2749, SEQ ID NO:2665, SEQ ID NO:2649, SEQ ID NO:2637, SEQ ID NO:2634, SEQ ID 
NO:2709, SEQ ID NO:2653, SEQ ID NO.-2729. 

In another aspect, the invention is directed to a method of diagnosing or monitoring kidney 
transplant rejection in a patient by detecting one or more proteins in a bodily fluid of the patient to 
diagnose or monitor transplant rejection in the patient wherein the one or more proteins have a protein 
sequence selected from SEQ ID NO:76, SEQ ID NO:2663, SEQ ID NO:98, SEQ ID NO:2696, SEQ ID 
NO.2736, SEQ ID NO:2751, SEQ ID NO:2631, SEQ ID NO:2675, SEQ ID NO:2700, and SEQ ID 
NO:2693. 

In a further aspect, the invention is also directed to a system for detecting gene expression in 
body fluid including at least two isolated polynucleotides wherein the isolated polynucleotides detect 
expression of a gene wherein the gene includes a nucleotide sequence selected from SEQ ID NO:2, 
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SEQ ID NO:3, SEQ ID NO:4, SEQ ID N0.5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:9, SEQ ID 
NO: 1 0, SEQ ID NO: 1 1 , SEQ ID NO: 1 2, SEQ ID NO: 1 3, SEQ ID NO: 14, SEQ ID NO: 1 5, SEQ ID 
NO: 1 6, SEQ ID NO: 1 7, SEQ ID NO: 1 8, SEQ ID NO: 1 9, SEQ ID NO:20, SEQ ID NO:2 1 , SEQ ID 
NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:25, SEQ ID NO:26, SEQ ID N0.27, SEQ ID 
NO:28, SEQ ID NO.29, SEQ ID NO:30, SEQ ID NO:3l', SEQ ID NO:32,"SEQ ID NO:33, SEQ ID 
N034, SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ ID 
NO:40, SEQ ID NO:41, SEQ ID NO:42, SEQ ID NO.:43, SEQ ID NO:44, SEQ.ID.NO:45, SEQ ID 
NO:46, SEQ ID NO:47, SEQ ID NO:48, SEQ ID NO:49, SEQ ID NO:50, SEQ ID NO:51, SEQ ID 
NO:52, SEQ ID NO:53, SEQ ID NO:54, SEQ ID NO:55, SEQ ID NO.56, SEQ ID NO:57, SEQ ID 
NO:58, SEQ ID NO:59, SEQ ID NO:60, SEQ ID NO:61, SEQ ID NO:62, SEQ ID NO:63, SEQ ID 
NO:64, SEQ ID NO:65, SEQ ID NO:66, SEQ ID NO:67, SEQ ID NO:68, SEQ ID NO:69, SEQ ID 
NO:70, SEQ ID NO:71 . SEQ ID NO:72, SEQ ID NO:73, SEQ ID NO:74, SEQ ID NO:82, SEQ ID 
NO:83, SEQ ID NO:84, SEQ ID NO:85, SEQ ID NO:86, SEQ ID NO:87, SEQ ID NO.88, SEQ ID 
NO:90, SEQ ID NO-.91, SEQ ID NO:92, SEQ ID NO:93, SEQ ID NO:94, SEQ ID NO:95, SEQ ID 
NO:96, SEQ ID NO:98, SEQ ID NO: 101, SEQ ID NO:102, SEQ ID NO.103, SEQ ID NO:104, SEQ 
ID NO: 105, SEQ ID NO: 106, SEQ ID NO: 1 07, SEQ ID NO: 108, SEQ ID NO: 1 09, SEQ ID NO: 1 14, 
SEQ ID NO:l 15, SEQ ID NO:l 16, SEQ ID NO.l 17, SEQ ID NO:l 18, SEQ ID NO:l 19, SEQ ID 
NO-.120, SEQ ID NO-.121, SEQ ID NO:122, SEQ ID NO:123, SEQ IDNO:124, SEQ ID NO:125, SEQ 
ID NO: 126, SEQ ID NO: 127, SEQ ID NO: 128, SEQ ID NO: 129, SEQ ID NO: 130, SEQ ID NO: 131, 
SEQ ID NO: 132, SEQ ID NO: 133, SEQ ID NO: 134, SEQ ID NO: 135, SEQ ID NO: 136, SEQ ID 
NO: 137, SEQ ID NO: 138, SEQ ID NO: 139, SEQ ID NO. 152, SEQ ID NO: 153, SEQ ID NO:154, SEQ 
ID NO: 155, SEQ ID NO: 156, SEQ ID NO: 157, SEQ ID NO: 158, SEQ ID NO: 159, SEQ ID NO: 160, 
SEQ ID NO: 161, SEQ ID NO: 162, SEQ ID NO: 163, SEQ ID NO: 164, SEQ ID NO: 165, SEQ ID 
NO:166, SEQ ID NO:167, SEQ ID NO:168, SEQ ID NO:169, SEQ ID NO:170, SEQ EDNO:171, SEQ 
ID NO:172, SEQ ID NO:173, SEQ ID NO:174, SEQ ID NO:175, SEQ ID NO-.176, SEQ ID NO.177, 
SEQ ID NO:178, SEQ IDNO:179, SEQIDNO:180, SEQ IDNO:181, SEQ ID NO:182, SEQ ID 
NO: 1 83, SEQ ID NO: 1 84, SEQ ID NO: 1 85, SEQ ID NO: 1 86, SEQ ID NO: 1 87, SEQ ID NO: 188, SEQ 
ID NO:189, SEQ ID NO:190, SEQ ID NO:191, SEQ ID NO:192, SEQ ID NO:193, SEQ ID NO:194, 
SEQ ID NO:195, SEQ ID NO:196, SEQ ID NO:197, SEQ ID NO.198, SEQ ID NO.199, SEQ ID 
NO-.200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ 
ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1, 
SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID 
NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ 
ID NO-.223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, 
SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID NO:233, SEQ ID 
NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ 
ID NO:240, SEQ ID NO:241,.SEQ ID NO:242, SEQ ID NO:243, SEQ ID NO:244, SEQ ID NO:245, 
SEQ ID NO:246, SEQ ID NO:247, SEQ ID NO-.248," SEQ ID NO.249, SEQ ID NO:250, SEQ ID 
NO:251, SEQ ID NO:252, SEQ ID NO:253, SEQ ID NO:254, SEQ ID NO:255, SEQ ID NO:256; SEQ 
ID NO:257, SEQ ID NO:258; SEQ ID NO:259, SEQ ID NO:260, SEQ ID NO:261, SEQ ID NO:262, 
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SEQ ID NO.263, SEQ ID NO.264, SEQ ID NO:265, SEQ ID NO:266, SEQ ID NO:267, SEQ ID 
NO:268, SEQ ID NO:269, SEQ ID NO:270, SEQ ID NO.271, SEQ ID NO:272, SEQ ID NO:273, SEQ 
ID NO:274, SEQ ID NO:275, SEQ ID NO:276, SEQ ID NO:277, SEQ ID NO:278, SEQ ID NO:279, 
SEQ ID NO:280, SEQ ID NO:281, SEQ ID NO:282, SEQ ID NO:283, SEQ ID NO:284, SEQ ID 
NO:285, SEQ ID NO:286, SEQ ID NO:287, SEQ ID NO:288, SEQ ID NO:289, SEQ ID NO:290, SEQ 
ID NO:291, SEQ ID NO:292, SEQ ID NO:293, SEQ ID NO:294, SEQ ID NO:295, SEQ ID NO:296, 
SEQ ID NO:297, SEQ ID NO:298, SEQ ID NO:299, SEQ ID NO:300, SEQ ID NO:301, SEQ ID 
NO:302, SEQ ID NO:303, SEQ ID NO:304, SEQ ID NO:305, SEQ ID NO.306, SEQ ID NO.307, SEQ 
ID NO-.308, SEQ ID NO.309, SEQ ID NO.310, SEQ ID NO:31 1, SEQ ID NO:312, SEQ ID NO:313, 
SEQ ID NO:314, SEQ ID NO:315, SEQ ID NO:3I6, SEQ ID NO:317, SEQ ID NO:318, SEQ ID 
NO:319, SEQ ID NO:320, SEQ ID NO:321, SEQ ID NO:322, SEQ ID NO:323, SEQ ID NO:324, SEQ 
ID NO:325, SEQ ID NO:326, SEQ ID NO:327, SEQ ID NO:328, SEQ ID NO:329, SEQ ID NO:330, 
SEQ ID NO:33 1 , SEQ ID NO:332, SEQ ID NO.2697, SEQ ID NO:2645, SEQ ID NO:2707, SEQ ID 
NO:2679, SEQ ID NO:2717, SEQ ID NO:2646, SEQ ID NO:2667, SEQ ID NO:2706, SEQ ID 
NO:2740, SEQ ID NO:2669, SEQ ID NO:2674, SEQ ID NO:2743, SEQ ID NO:2716, SEQ ID 
NO:2727, SEQ ID NO:2721, SEQ ID NO:2641, SEQ ID NO:2671, SEQ ID NO:2752, SEQ ID 
NO:2737, SEQ ID NO:2719, SEQ ID NO:2684, SEQ ID NO.2677, SEQ ID NO.2748, SEQ ID 
NO:2703, SEQ ID NO:271 1, SEQ ID NO:2663, SEQ ID NO:2657, SEQ ID NO:2683, SEQ ID 
NO:2686, SEQ ID NO:2687, SEQ ID NO:2644, SEQ ID NO:2664, SEQ ID NO:2747, SEQ ID 
NO:2744, SEQ ID NO:2678, SEQ ID NO:2731, SEQ ID NO:2713, SEQ ID NO:2736, SEQ ID 
NO:2708, SEQ ID NO.2670, SEQ ID NO:2661, SEQ ID NO.2680, SEQ ID NO:2754, SEQ ID 
NO:2728, SEQ ID NO:2742, SEQ ID NO:2668, SEQ ID NO:2750, SEQ ID NO:2746, SEQ ID 
NO:2738, SEQ ID NO:2627, SEQ ID NO:2739, SEQ ID NO:2647, SEQ ID NO:2628, SEQ ID 
NO:2638, SEQ ID NO:2725, SEQ ID NO:2714, SEQ ID NO:2635, SEQ ID NO:2751, SEQ ID 
NO:2629, SEQ ID N0.2695, SEQ ID NO:2741, SEQ ID NO.2691, SEQ ID NO:2726, SEQ ID 
NO:2722, SEQ ID NO:2689, SEQ ID NO:2734, SEQ ID NO:2631, SEQ ID NO:2656, SEQ ID 
NO:2696, SEQ ID NO:2676, SEQ ID NO:270I, SEQ ID NO:2730, SEQ ID NO:2710, SEQ ID 
NO:2632, SEQ ID NO:2724, SEQ ID NO:2698, SEQ ID NO.2662, SEQ ID NO:2753, SEQ ID 
NO:2704, SEQ ID NO:2675, SEQ ID NO.2700, SEQ ID NO:2640, SEQ ID NO:2723, SEQ ID 
NO:2658, SEQ ID NO:2688, SEQ ID NO:2735, SEQ ID NO:2702, SEQ ID NO:268 1 , SEQ ID 
NO:2755, SEQ ID NO:2715, SEQ ID NO:2732, SEQ ID NO:2652, SEQ ID NO:2651, SEQ ID 
NO:2718, SEQ ID NO:2673, SEQ ID NO:2733, SEQ ID NO:2712, SEQ ID NO:2659, SEQ ID 
NO:2654, SEQ ID NO:2636, SEQ ID NO:2639, SEQ ID NO.2690, SEQ ID NO.2705, SEQ ID 
NO:2685, SEQ ID NO:2692, SEQ ID NO:2693, SEQ ID NO:2648, SEQ ID NO:2650, SEQ ID 
NO:2720, SEQ ID NO:2660, SEQ ID NO:2666, SEQ ID NO:2699, SEQ ID NO:2633, SEQ ID 
NO:2672, SEQ ID NO:2642, SEQ ID NO:2682, SEQ ID NO:2655, SEQ ID NO:2630, SEQ ID 
NO:2745, SEQ ID NO:2643, SEQ ID NO:2694, SEQ ID NO:2749, SEQ ID NO:2665, SEQ ID 
NO-.2649, SEQ ID NO:2637, SEQ ID NO:2634, SEQ ID NO:2709, SEQ ID NO:2653, SEQ ID 
NO:2729 and the gene is differentially expressed in body fluid in an individual rejecting a transplanted 
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organ compared to the expression of the gene in leukocytes in an individual not rejecting a transplanted 
organ. 

In another aspect, the invention is directed to a system for detecting gene expression in body 
fluid including at least two isolated polynucleotides wherein the isolated polynucleotides detect 
expression of a gene wherein the gene includes a nucleotide sequence selected from SEQ ID NO:2, 
SEQ ID NO.3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID 
NO:9, SEQ ID NO: 10, SEQ ID NO: 1 1 V .SEQ ID NO: 1 2, SEQ ID NO: 13, SEQ ID NO: 1 4, SEQ ID 
NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO:20, SEQ ID 
NO:2 1 , SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:25, SEQ ID NO:26, SEQ ID 
NO:27, SEQ ID NO:28, SEQ ID NO:29, SEQ ID NO:30, SEQ ID NO:31, SEQ ID NO:32, SEQ ID 
NO:33, SEQ ID NO:34, SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ ID 
NO:39, SEQ ID NO:40, SEQ ID NO:41, SEQ ID NO:42, SEQ ID NO:43, SEQ ID NO:44, SEQ ID 
NO:45, SEQ ID NO:46, SEQ ID NO:47, SEQ ID NO:48, SEQ ID NO:49, SEQ ID NO:50, SEQ ID 
NO:51, SEQ ID NO:52, SEQ ID NO:53, SEQ ID NO:54, SEQ ID NO:55, SEQ ID NO:56, SEQ ID 
NO:57, SEQ ID NO:58, SEQ ID NO:59, SEQ ID NO:60, SEQ ID NO:61, SEQ ID NO:62, SEQ ID 
NO:63, SEQ ID NO:64, SEQ ID NO:65, SEQ ID NO:66, SEQ ID NO:67, SEQ ID NO:68, SEQ ID 
NO:69, SEQ ID NO:70, SEQ ID NO:71, SEQ ID NO:72, SEQ ID NO:73, SEQ ID NO:74, SEQ ID 
NO:75, SEQ ID NO:76, SEQ ID NO:77, SEQ ID NO:78, SEQ ID NO:79, SEQ ID NO:80, SEQ ID 
NO:81, SEQ ID NO:82, SEQ ID NO:83, SEQ ID NO:84, SEQ ID NO:85, SEQ ID NO:86, SEQ ID 
NO:87, SEQ ID NO:88, SEQ ID NO:89, SEQ ID NO:90, SEQ ID NO:91, SEQ ID NO:92, SEQ ID 
NO:93, SEQ ID NO:94, SEQ ID NO:95, SEQ ID NO:96, SEQ ID NO:97, SEQ ID NO:98, SEQ ID 
NO:99, SEQ ID NO: 100, SEQ ID NO: 101, SEQ ID NO: 102, SEQ ID NO:103, SEQ ID NO: 104, SEQ 
ID NO:105, SEQ ID NO:106, SEQ IDNO:107, SEQ ID NO:108, SEQ ID NO:I09, SEQ ID NO:l 10, 
SEQ ID NO:l 1 1, SEQ ID NO:l 12, SEQ ID NO:l 13, SEQ ID NO:l 14, SEQ ID NO:l 15, SEQ ID 
NO. l 16, SEQ ID NO:l 17, SEQ ID NO:l 18, SEQ ID NO:l 19, SEQ ID NO: 120, SEQ ID NO: 121, SEQ 
ID NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ IDNO:126, SEQ ID NO:127, 
SEQ ID NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID NO:131, SEQ ID NO:132, SEQ ID 
NO: 133, SEQ ID NO: 134, SEQ ID NO: 135, SEQ ID NO: 136, SEQ ID NO: 137, SEQ ID NO: 138, SEQ 
ID NO:139, SEQ ID NO:140, SEQ ID NO:141, SEQ ID NO:142, SEQ IDNO:143, SEQ ID NO:144, 
SEQ ID NO:145, SEQ ID NO.146, SEQ ID NO:147, SEQ ID NO-.148, SEQ ID NO:149, SEQ ID 
NO: 150, SEQ IDNO:151, SEQ ID NO: 152, SEQ ID NO: 153, SEQ ID NO: 154, SEQ ID NO: 155, SEQ 
ID NO:156, SEQ ID NO:157, SEQ ID NO:158, SEQ ID NO:159, SEQ ID NO:160, SEQ ID NO.161, 
SEQ ID NO:162, SEQ ID NO:163, SEQ ID NO:164, SEQ ID NO:165, SEQ ID NO:166, SEQ ID 
NO:167, SEQ ID NO.168, SEQ ID NO.169, SEQ ID NO:170, SEQ ID NO:171, SEQ ID NO:172, SEQ 
ID NO: 1 73, SEQ ID NO: 1 74, SEQ ID NO: 1 75, SEQ ID NO: 1 76, SEQ ID NO: 1 77, SEQ ID NO: 1 78, 
SEQ ID NO: 179, SEQ ID NO: 180, SEQ ID NO: 181, SEQ ID NO: 182, SEQ ID NO: 183, SEQ ID 
NO:184, SEQ ID NO:185, SEQ ID NO: 186, SEQ ID NO:187, SEQ ID NO:188, SEQ ID NO.189, SEQ 
ID NO: 1 90, SEQ ID NO: 191 , SEQ ID N6: 1 92, SEQ ID NO. 1 93, SEQ ID NO: 194, SEQ ID NO: 1 95, 
SEQ ID NO:196^ SEQ ID NO: 197, SEQ ID NQ:198, SEQ ID NO: 199, SEQ ID NO:200, SEQ ID 
NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ 
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ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID NO:2 12, 
SEQ ID NO:2 1 3, SEQ ID NO:2 1 4, SEQ ID NO:2 1 5, SEQ ID NO:2 1 6, SEQ ID NO:2 1 7, SEQ ID 
NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ 
ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ~ID NO:228, SEQ ID NO:229, 
SEQ ID NO:230, SEQ IE) NO:23 1 , SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID 
NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ 
ID NO:241, SEQ ID NO:242, SEQ ID NO:243, JSEQ ID NO:244, SEQ ID NO:245, SEQ ID NO:246, 
SEQ ID NO:247, SEQ ID NO:248, SEQ ID NO:249, SEQ ID NO:250, SEQ ID NO:251, SEQ ID 
NO:252, SEQ ID NO:253, SEQ ID NO:254, SEQ ID NO:255, SEQ ID NO:256, SEQ ID NO:257, SEQ 
ID NO:258, SEQ ID NO:259, SEQ ID NO:260, SEQ ID NO:261, SEQ ID NO:262, SEQ ID NO:263, 
SEQ ID NO:264, SEQ ID NO:265, SEQ ID NO:266, SEQ ID NO:267, SEQ ID NO:268, SEQ ID 
NO:269, SEQ ID NO:270, SEQ ID NO:271, SEQ ID NO:272, SEQ ID NO:273, SEQ ID NO:274, SEQ 
ID NO:275, SEQ ID NO:276, SEQ ID NO:277, SEQ ID NO:278, SEQ ID NO:279, SEQ ID NO:280, 
SEQ ID NO:281, SEQ ID NO:282 > SEQ ID NO:283, SEQ ID NO:284, SEQ ID NO:285, SEQ ID 
NO:286, SEQ ID NO:287, SEQ ID NO:288, SEQ ID NO:289, SEQ ID NO:290, SEQ ID NO:291, SEQ 
ID NO:292, SEQ ID NO:293, SEQ ID NO:294, SEQ ID NO:295, SEQ ID NO:296, SEQ ID NO:297, 
SEQ ID NO:298, SEQ ID NO:299, SEQ ID NO:300, SEQ ID NO:301, SEQ ID NO:302, SEQ ID 
NO:303, SEQ ID NO:304, SEQ ED NO:305, SEQ ID NO:306, SEQ ID NO:307, SEQ ID NO:308, SEQ 
ID NO:309, SEQ ID NO:310, SEQ ID NO:31 1, SEQ ID NO:312, SEQ ID NO:313, SEQ ID NO:314, 
SEQ ID NO:315, SEQ ID NO:316, SEQ ID NO:317, SEQ ID NO:318, SEQ ID NO:319, SEQ ID 
NO:320, SEQ ID NO:321, SEQ ID NO:322, SEQ ID NO:323, SEQ ID NO:324, SEQ ID NO:325, SEQ 
ID NO:326, SEQ ID NO:327, SEQ ID NO:328, SEQ ID NO:329, SEQ ID NO:330, SEQ ID NO:331, 
SEQ ID NO:332, SEQ ID NO:2697, SEQ ID NO:2645, SEQ ID NO:2707, SEQ ID NO:2679, SEQ ID 
NO:2717, SEQ ID NO:2646, SEQ ID NO:2667, SEQ ID NO:2706, SEQ ID NO:2740, SEQ ID 
NO:2669, SEQ ID NO:2674, SEQ ID NO:2743, SEQ ID NO:2716, SEQ ID NO:2727, SEQ ID 
NO:2721, SEQ ID NO:2641, SEQ ID NO:2671, SEQ ID NO:2752, SEQ ID NO:2737, SEQ ID 
NO:2719, SEQ ID NO:2684, SEQ ID NO:2677, SEQ ID NO:2748, SEQ ID NO:2703, SEQ ID 
NO:271 1, SEQ ID NO:2663, SEQ ID NO:2657, SEQ ID NO:2683, SEQ ID NO:2686, SEQ ID 
NO:2687, SEQ ID NO:2644, SEQ ID NO:2664, SEQ ID NO:2747, SEQ ID NO:2744, SEQ ID 
NO.-2678, SEQ ID NO:2731, SEQ ID NO:2713, SEQ ID NO:2736, SEQ ID NO:2708, SEQ ID 
NO:2670, SEQ ID NO:2661, SEQ ID NO:2680, SEQ ID NO:2754, SEQ ID NO:2728, SEQ ID 
NO:2742, SEQ ID NO:2668, SEQ ID NO:2750, SEQ ID NO:2746, SEQ ID NO:2738, SEQ ID 
NO:2627, SEQ ID NO:2739, SEQ ID NO:2647, SEQ ID NO:2628, SEQ ID NO:2638, SEQ ID 
NO:2725, SEQ ID NO:2714, SEQ ID NO:2635, SEQ ID NO:2751, SEQ ID NO:2629, SEQ ID 
NO:2695, SEQ ID NO:2741, SEQ ID NO:2691, SEQ ID NO:2726, SEQ ID NO:2722, SEQ ID 
NO:2689, SEQ ID NO:2734, SEQ ID NO:2631, SEQ ID NO:2656, SEQ ID NO:2696, SEQ ID 
NO:2676, SEQ ID NO:2701, SEQ ID NO:2730, SEQ ID NO:2710, SEQ ID NO:2632, SEQ ID 
NO:2724, SEQ ID NO: 2 698, SEQ ID NO:2662, SEQ ID NO:2753, SEQ ID NO:2704, SEQ ID 
NO:2675, SEQ ID NO:2700, SEQ ID NO 2640, SEQ ID.NO:2723, SEQ ID NO:2658, SEQ ID 
NO:2688, SEQ ID NO:2735> SEQ ID NO:2702, SEQ ID NO:2681, SEQ ID NO:2755, SEQ ID 
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NO:2715, SEQ ID NO:2732, SEQ ID NO:2652, SEQ ID NO:2651, SEQ ID NO.2718, SEQ ID 
NO.2673, SEQ ID NO:2733, SEQ ID NO:2712, SEQ ID NO:2659, SEQ ID NO:2654, SEQ ID 
NO:2636, SEQ ID NO:2639, SEQ ID NO:2690, SEQ ID NO:2705, SEQ ID NO:2685, SEQ ID 
NO-2692, SEQ ID NO:2693, SEQ ID NO:2648, SEQ ID NO:2650, SEQ ID NO:2720, SEQ ID 
NO':2660, SEQ ID NO:2666, SEQ lb NO:2699, SEQ ID NO:2633, SEQ ID NO:2672, SEQ ID 
NO.2642, SEQ ID NO: 2682, SEQ ID NO:2655, SEQ ID NO:2630, SEQ ID NO:2745, SEQ ID 
NO:2643, SEQ ID NO:2694, SEQ ID NO:2749, SEQ ID NO:2665, SEQ ID NO:2649, SEQ ID 
NO:2637, SEQ ID NO:2634, SEQ ID NO:2709, SEQ ID NO.2653, SEQ ID NO:2729 and the gene 
expression is related to the rate of hematopoiesis or the distribution of hematopoeitic cells along their 
maturation pathway. 

The invention is also directed to methods of diagnosing or monitoring transplant rejection in a 
patient by detecting the expression level of one or more genes including a nucleotide sequence selected 
from SEQ ID NOS: 3016-3 1 17. SEQ ID NOS:3 108-3 1 17 are useful in detecting CMV infection. 

Brief Description of the Sequence Listing 

SEQ ID*s 1-332 are 50mer oligonucleotides corresponding to gene expression markers for diagnosis 
and monitoring of allograft rejection and other disorders. 

SEQ ID'S 333-664 are Reference mRNA sequences for genes identified by probes 1-332. 

SEQ ID'S 665-995 are a first set of Left PCR primers for genes 1-332. 

SEQ ID's 996-1326 are a first set of Right PCR primers for genes 1-332. 

SEQ ID'S 1327-1657 are Taqman probes for the first set PCR primers for genes 1-332. 

SEQ ID's 1658-1903 are a second alternative set of left PCR primers for selected genes 1-332 

SEQ ID's 1904-2151 are a second alternative set of right PCR primers for selected genes 1-332 

SEQ ID's 2152-2399 are Taqman probes for the second alternative set of PCR primers for selected 

genes 1-332. 

SEQ ID's 2400-2626 are Proteins encoded by mRNA's from genes identified in 1-332. 

SEQ ID's 2627-2795 are 50mer oligonucleotide array probes used to identify genes in Figure 7 and 

Tables 6 and 8. 

SEQ ID's 2796-2924 are reference mRNA sequences for genes in Table 8 which show altered 
expression in renal transplantation and rejection. 

SEQ ID's 2925-3015 are proteins coded by genes which show altered expression in Table 8. 
SEQ ID's 3016-3081 are 50mer oligonucleotide array probes and used to identify genes in the 
Examples. 

SEQ ID's 3082-3107 are genes and primers discussed in the Examples. 

SEQ ID's 3108-31 17 are mRNAs from human genes in which regulation is altered upon CMV 
infection. 

Brief Description of the Figures 

Figure 1: Figure 1 is a schematic flow chart illustrating a schematic instruction set for characterization 
of the nucleotide sequence and/or the predicted protein sequence of novel nucleotide sequences. 
Figure 2: Figure 2 depicts the components of an automated RNA preparation machine. 
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Figure 3 shows the results of six hybridizations on a mini array graphed (n=6 for each column). The 
error bars are the SEM. This experiment shows that the average signal from AP prepared RNA is 47% 
of the average signal from GS prepared RNA for both Cy3 and Cy5. 

Figure 4 shows the average background subtracted signal for each of nine leukocyte-specific genes on 
a mini array. This average is for 3-6 of the above-described hybridizations for each gene. The error 
bars are the SEM. 

Figure .5 shows the ratio of Cy3 to Cy5 signal for a number of genes. After normalization, this ratio 
corrects for variability among hybridizations and allows comparison between experiments done at 
different times. The ratio is calculated as the Cy3 background subtracted signal divided by the Cy5 
background subtracted signal. Each bar is the average for 3-6 hybridizations. The error bars are SEM. 
Figure 6 shows data median Cy3 background subtracted signals for control RNAs using mini arrays. 
Figure 7: Cardiac Allograft rejection diagnostic genes. 

A. Example of rejection and no-rejection samples expression data for 5 marker genes. For each 
sample, the associated rejection grades are shown as are the expression ratios for 5 differentially 
expressed genes. The genes are identified by the SEQ ID number for the oligonucleotide. The average 
fold difference between grade 0 and grade 3A samples is calculated at the bottom. 

B. CART classification model. Decision tree for a 3 gene classification model for diagnosis of 
cardiac rejection. In the first step, expression of gene 223 is used to divide the patients to 2 branches. 
The remaining samples in each branch are then further divided by one remaining gene. The samples 
are classified as either rejection or no rejection. 1 no rejection sample is misclassified as a rejection 
sample. 

C. Surrogates for the CART classification model. For each of the 3 splitter genes in the CART 
rejection model described in the example, 5 top surrogate genes are listed that were identified by the 
CART algorithm. 

Figure 8: Validation of differential expression of a gene discovered using microarrays using real-time 
PCR 

Figure 8A. The Ct for each patient sample on multiple assays is shown along with the Ct in the R50 
control RNA. Triangles represent ~RT (reverse transcriptase) controls. 

Figure 8B. The fold difference between the expression of Granzyme B and an Actin reference is 
shown for 3 samples from patients with and without CMV disease. 
Figure 9: Endpoint testing of PCR primers 

Electrophoresis and microfluidics are used to assess the product of gene specific PCR primers. 
P-GUS gel image. Lane 3 is the image for primers F178 and R242. Lanes 2 and 1 correspond to the 
no-template control and -RT control, respectively. 

The electropherogram of p-GUS primers F178 and R242, a graphical representation of Lane 3 from the 
gel image. 

3-Actin gel image. Lane 3 is the image for primers F75 and R178. Lanes 2 and 1 correspond to the 
no-template control and -RT control, respectively. 

The. electropherogram of P-Actin primers F75 and R178, a graphical representation of Lave 3 from the 
gel image. 
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Figure 10: PCR Primer efficiency testing. A standard curve of Ct versus log of the starting RNA 

amount is shown for 2 genes. 

Figure 11: Real-time PCR control gene analysis 

1 1 candidate control genes were tested using real-time PCR on 6 whole blood samples (PAX) paired 

with 6 mononuclear samples (CPT) from the same patient. Each sample was tested twice. For each 

gene, the variability of the gene across the samples is shown on the vertical axis (top graph). The 

average Ct value for each gene is also shown (bottom graph). 2ug RNA was used for PAX samples 

and 0.5 ug total RNA was used for the mononuclear samples (CPT). 

Figure 12: Rejection marker discovery by co-expression with established marker 

Microarrays were used to measure expression of genes SEQ ID 85 and 302 in samples derived from 

240 transplant recipients. For each sample, the expression measurement for 85 is plotted against 302. 

Figure 13: ROC (receiver operator characteristics) curve for a 3-gene PCR assay for diagnosis of 

rejection (see example 17). The Sensitivity and False Positive Rate for each test cutoff is shown. 

Brief Description of the Tables 

Table 1: Table 1 lists diseases or conditions amenable to study by leukocyte profiling. 
Table 2: Transplant Markers 

A. Transplant Genes: Genes useful for monitoring of allograft rejection are listed in this here. The 
gene symbol and name are given. SEQ ID 50mer is the sequence ID of a 50mer oligonucleotide 
that is specific for the gene. The NCBI Unigene number (HS) from (Build 160, 16 Feb 2003) is given 
as is an accession number (ACC) from (Genbank Release 135, 15 April 2003) for an RNA or cDNA is 
Genbank that corresponds to the gene. The sequence identified by the ACC number is in the sequence 
listing (SEQ ID RNA/cDNA). 

B. Microarray Data: SEQ ID 50mer, Gene, Gene Name, ACC and SEQ ID RNA/cDNA are given for 
each gene as in A (above). Each identified gene has a Non-Parametric Score and Median Rank in NR 
given from the non-parametric analysis of the data. The genes are ranked from highest to lowest 
scoring. Down Regulated genes are noted with a 1 in this column. 

C. PCR Primers: Primers and probes for real-time PCR assays for each gene are given along with their 
SEQ ID #s. Each gene has 1 or 2 sets of a forward and reverse PCR primer and a hybridization probe 
for detection in TaqMan or similar assays. 

D. PCR Data: Real-time PCR data was generated on a set of transplant samples using sybr green 
technology as described in the text. For each gene the number of samples (n) used in the analysis is 
given. An odds ratio and the p-values for a Fisher test and t-test are given for the comparison of acute 
rejection samples is given (see text). 

E. Transplant proteins: For each gene, the corresponding protein in the RefSeq data base (Genbank 
Release 135, 18 April 2003) is given (RefSeq Peptide Accession #) along the the SEQ ID for that 
protein for the sequence listing. 

Table 3: Viral gene for arrays. Viral genomes were used to design oligonucleotides for the 
microarrays. The accession numbers for the viral genomes used are given, along with the gene name 
and location of the region used for oligonucleotide design. 
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Table 4. Dependent variables for discovery of gene expression markers of cardiac allograft rejection. 
A stable Grade 0 is a Grade 0 biopsy in a patient who does not experience rejection with the subsequent 
biopsy. HG or highest grade means that the higher of the biopsy grades from the centralized and local 
pathologists was used for a definition of the dependent variable. 

Table 5: Real-time PCR assay reporter and quencher dyes. Various combinations of reporter and 
quencher dyes are useful for real-time PCR assays. Reporter and quencher dyes work optimally in 
specific combinations defined by their spectra. For each reporter, appropriate choices for quencher 
dyes are given. 

Table 6: Rejection marker PCR assay results 

Results of real-time PCR assays are listed for the comparison of rejection samples to no rejection 
samples. The fold change is given for expression of each gene in rejection/no rejection samples. The 
p-value for the t-test comparing the rejection and no rejection classes is given. 
Table 7: Summary results of array rejection significance analysis. Summary results are given for 
correlation analysis of leukocyte gene expression to acute rejection using significance analysis for 
microarrays (SAM). Five analyses are described. The ISHLT grades used to define the rejection and 
no rejection classes are given. In each case the highest grade from three pathology reading was taken 
for analysis. All samples are used for two analyses. The other analyses reduce redundancy of patients 
used in the analysis by using only one sample per patient ("Non-redundant") or using only one sample 
per patient within a given class ("Non-redundant within class"). The number of samples used in the 
analysis is given and the lowest false detection rate (FDR) achieved is noted. 
Table 8: Renal tissue rejection array significance analysis. Genes are listed that were identified as 
upregulated using microarrays on renal tissue with acute rejection versus controls. Significance 
analysis for microarrays (SAM) was used to determine the false detection rate for each gene (FDR). 
Genes with known expression in leukocytes are noted in the table. 

Table 9: Rejection marker sequence analysis. For 63 of the allograft rejection markers listed in Table 
2, an analysis of the gene sequence was done. The genes and proteins are identified by accession 
numbers. The cellular localization of each gene is described as either secreted, nuclear, mitochondrial, 
cytoplasmic or cellular membrane. The function of the gene is also described. 

Table 10: Gene expression markers for immature cells of a variety of lineages are given in Table 10 by 
way of example 

Table 11: Changes in the rate of hematopoiesis have been correlated to a number of disease states and 
other pathologies. Examples of such conditions are listed in Table 1 1 . 

Table 12: This table lists the oligonucleotides and associated genes identified as having value for the 
diagnosis and monitoring of CMV infection. The first column gives the SEQ ID that corresponds to the 
oligonuclotide in the sequence listing. The unigene number, genebank accession and GI number are 
also given for each sequence when known. The name of the gene associated with the accession number 
is noted. The strand is noted as -1 or 1, meaning that the probe was designed from the complement of 
the sequence (-1) or directly from the sequence (1). Next, the nucleotide sequence of each probe is also 
given. For each gene, the false detection rate (FDR) from the significance analsysis described in 
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example 7 is given if applicable. WBC is the white blood cell count. WPT is the number of weeks 
past transplant. 

Detailed Description of the Invention 
Definitions 

Unless defined otherwise, all scientific and technical terms are understood to have the same 
meaning as commonly used in the art to which they pertain. For the purpose of the present invention, 
the folio wing terms are defined below. 

In the context of the invention, the term "gene expression system" refers to any system, device 
or means to detect gene expression and includes diagnostic agents, candidate libraries, oligonucleotide 
sets or probe sets. 

The term "monitoring" is used herein to describe the use of gene sets to provide useful 
information about an individual or an individual's health or disease status. "Monitoring" can include, 
determination of prognosis, risk-stratification, selection of drug therapy, assessment of ongoing drug 
therapy, prediction of outcomes, deterrnining response to therapy, diagnosis of a disease or disease 
complication, following progression of a disease or providing any information relating to a patients 
health status over time, selecting patients most likely to benefit from experimental therapies with 
known molecular mechanisms of action, selecting patients most likely to benefit from approved drugs 
with known molecular mechanisms where that mechanism may be important in a small subset of a 
disease for which the medication may not have a label, screening a patient population to help decide on 
a more invasive/expensive test, for example a cascade of tests from a non-invasive blood test to a more 
invasive option such as biopsy, or testing to assess side effects of drugs used to treat another 
indication.. 

The term "diagnostic oligonucleotide set" generally refers to a set of two or more 
oligonucleotides that, when evaluated for differential expression of their products, collectively yields 
predictive data. Such predictive data typically relates to diagnosis, prognosis, monitoring of 
therapeutic outcomes, and the like. In general, the components of a diagnostic oligonucleotide set are 
distinguished from nucleotide sequences that are evaluated by analysis of the DNA to directly 
determine the genotype of an individual as it correlates with a specified trait or phenotype, such as a 
disease, in that it is the pattern of expression of the components of the diagnostic nucleotide set, rather 
than mutation or polymorphism of the DNA sequence that provides predictive value. It will be 
understood that a particular component (or member) of a diagnostic nucleotide set can, in some cases, 
also present one or more mutations, or polymorphisms that are amenable to direct genotyping by any of 
a variety of well known analysis methods, e.g., Southern blotting, RFLP, AFLP, SSCP, SNP, and the 
like. 

A "disease specific target oligonucleotide sequence" is a gene or other oligonucleotide that 
. encodes a polypeptide, most typically a protein, or a subunit of a multi-subunit protein, that is a 
therapeutic target for a disease, or group of diseases. 

A "candidate library" or a "candidate oligonucleotide library" refers to a collection of 
oligonucleotide sequences (or gene sequences) that by one or more criteria have an increased 
probability of being associated with a particular disease or group of diseases. The criteria can be, for 
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example, a differential expression pattern in a disease state or in activated or resting leukocytes in vitro 
as reported in the scientific or technical literature, tissue specific expression as reported in a sequence 
database, differential expression in a tissue or cell type of interest, or the like. Typically, a candidate 
library has at least 2 members or components; more typically, the library has in excess of about 10, or 
about 100, or about 1000, or even more, members or components. 

The term "disease criterion" is used herein to designate an indicator of a disease, such as a 
diagnostic factor, a prognostic factor, a factor indicated by a medical or family history, a genetic factor, 
or a symptom, as well as an overt or confirmed diagnosis of a disease associated with several indicators 
such as those selected from the above list. A disease criterian includes data describing a patient's 
health status, including retrospective or prospective health data, e.g. in the form of the patient's medical 
history, laboratory test results, diagnostic test result, clinical events, medications, lists, response(s) to 
treatment and risk factors, etc. 

The terms "molecular signature" or "expression profile" refers to the collection of expression 
values for a plurality (e.g., at least 2, but frequently about 10, about 100, about 1000, or more) of 
members of a candidate library. In many cases, the molecular signature represents the expression 
pattern for all of the nucleotide sequences in a library or array of candidate or diagnostic nucleotide 
sequences or genes. Alternatively, the molecular signature represents the expression pattern for one or 
more subsets of the candidate library. The term "oligonucleotide" refers to two or more nucleotides. 
Nucleotides maybe DNA or RNA, naturally occurring or synthetic. 

The term "healthy individual," as used herein, is relative to a specified disease or disease 
criterion. That is, the individual does not exhibit the specified disease criterion or is not diagnosed with 
the specified disease. It will be understood, that the individual in question, can, of course, exhibit 
symptoms, or possess various indicator factors for another disease. 

Similarly, an "individual diagnosed with a disease" refers to an individual diagnosed with a 
specified disease (or disease criterion). Such an individual may, or may not, also exhibit a disease 
criterion associated with, or be diagnosed with another (related or unrelated) disease. 

An "array" is a spatially or logically organized collection, e.g., of oligonucleotide sequences 
or nucleotide sequence products such as RNA or proteins encoded by an oligonucleotide sequence. In 
some embodiments, an array includes antibodies or other binding reagents specific for products of a 
candidate library. 

When referring to a pattern of expression, a "qualitative" difference in gene expression refers 
to a difference that is not assigned a relative value. That is, such a difference is designated by an "all or 
nothing" valuation. Such an all or nothing variation can be, for example, expression above or below a 
threshold of detection (an on/off pattern of expression). Alternatively, a qualitative difference can refer 
to expression of different types of expression products, e.g., different alleles (e.g., a mutant or 
polymorphic allele), variants (including sequence variants as well as post-translationally modified 
variants), etc. 

In contrast, a "quantitative" difference, when referring to a pattern of gene expression, refers 
to a difference in expression that can be assigned a value bri a graduated scale, (e.g., a 0-5 or 1-10 
scale, a + - +++ scale, a grade 1- grade 5 scale, or the like; it will be understood that the numbers 
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selected for illustration are entirely arbitrary and in no-way are meant to be interpreted to limit the 
invention). 

Gene Expression Systems of the Invention 

The invention is directed to a gene expression system having one or more DNA molecules 
wherein the one or more DNA molecules has a nucleotide sequence which detects expression of a gene 
corresponding to the oligonucleotides depicted in the Sequence Listing. In one format, the 
oligonucleotide detects expression of a gene that is differentially expressed in leukocytes. The gene 
expression system may be a candidate library, a diagnostic agent, a diagnostic oligonucleotide set or a 
diagnostic probe set. The DNA molecules may be genomic DNA, protein nucleic acid (PNA), cDNA 
or synthetic oligonucleotides. Following the procedures taught herein, one can identity sequences of 
interest for analyzing gene expression in leukocytes. Such sequences may be predictive of a disease 
state. 

Diagnostic oligonucleotides of the invention 
The invention relates to diagnostic nucleotide set(s) comprising members of the leukocyte 
candidate library listed in Table 2, Table 8, and in the Sequence Listing, for which a correlation exists 
between the health status of an individual, the individual's expression of RNA or protein products 
corresponding to the nucleotide sequence, and the diagnosis and prognosis of transplant rejection. In 
some instances, only one oligonucleotide is necessary for such detection. Members of a diagnostic 
oligonucleotide set may be identified by any means capable of detecting expression of RNA or protein 
products, including but not limited to differential expression screening, PCR, RT-PCR, SAGE analysis, 
high-throughput sequencing, microarrays, liquid or other arrays, protein-based methods (e.g., western 
blotting, proteomics, and other methods described herein), and data mining methods, as further 
described herein. 

In one embodiment, a diagnostic oligonucleotide set comprises at least two oligonucleotide 
sequences listed in Table 2, Table 8, or the Sequence Listing which are differentially expressed in 
leukocytes in an individual with at least one disease criterion for at least one leukocyte-implicated 
disease relative to the expression in individual without the at least one disease criterion, wherein 
expression of the two or more nucleotide sequences is correlated with at least one disease criterion, as 
described below. 

In another embodiment, a diagnostic nucleotide set comprises at least one oligonucleotide 
having an oligonucleotide sequence listed in Table 2, Table 8, or the Sequence Listing which is 
differentially expressed, and further wherein the differential expression/correlation has not previously 
been described. In some embodiments, the diagnostic nucleotide set is immobilized on an array. 

In another embodiment, diagnostic nucleotides (or nucleotide sets) are related to the members 
of the leukocyte candidate library listed in Table 2, Table 8, or in the Sequence Listing, for which a 
correlation exists between the health status, diagnosis and prognosis of transplant rejection (or disease 
criterion) of an individual. The diagnostic nucleotides are partially or totally contained in (or derived 
from) full-length gene sequences (or predicted full-length gene sequences) for the members of the 
candidate library, listed in Table 2, Table 8, and the sequence listing. In some cases, oligonucleotide 
sequences are designed from EST or Chromosomal sequences from a public database. In these cases 



54 



WO 2004/042346 PCT/US2003/0 12946 

the full-length gene sequences may not be known. Full-length sequences in these cases can be 
predicted using gene prediction algorithms. Alternatively the full-length can be determined by cloning 
and sequencing the full-length gene or genes that contain the sequence of interest using standard 
molecular biology approaches described here. The same is true for olignonucleotides designed from 
our sequencing of cDNA libraries where the cDNA does not match any sequence in the public 
databases. 

The diagnostic nucleotides may also be derived from other genes that are coexpressed with the 
correlated sequence or full-length gene. Genes may share expression patterns because they are 
regulated in the same molecular pathway. Because of the similarity of expression behavior genes are 
identified as surrogates in that they can substitute for a diagnostic gene in a diagnostic gene set. 
Example 4 demonstrates the discovery of surrogates from the data and the sequence listing identifies 
and gives the sequence for surrogates for cardiac diagnostic genes. 

As used herein the term "gene cluster" or "cluster" refers to a group of genes related by 
expression pattern. In other words, a cluster of genes is a group of genes with similar regulation across 
different conditions, such as graft non-rejection verus graft rejection. The expression profile for each 
gene in a cluster should be correlated with the expression profile of at least one other gene in that 
cluster. Correlation may be evaluated using a variety of statistical methods. As used herein the term 
"surrogate" refers to a gene with an expression profile such that it can substitute for a diagnostic gene 
in a diagnostic assay. Such genes are often members of the same gene cluster as the diagnostic gene. 
For each member of a diagnostic gene set, a set of potential surrogates can be identified through 
identification of genes with similar expression patterns as described below. 

Many statistical analyses produce a correlation coefficient to describe the relatedness between 
two gene expression patterns. Patterns may be considered correlated if the correlation coefficient is 
greater than or equal to 0.8. In preferred embodiments, the correlation coefficient should be greater 
than 0.85, 0.9 or 0.95. Other statistical methods produce a measure of mutual information to describe 
the relatedness between two gene expression patterns. Patterns may be considered correlated if the 
normalized mutual information value is greater than or equal to 0.7. In preferred embodiments, the 
normalized mutual information value should be greater than 0.8, 0.9 or 0.95. Patterns may also be 
considered similar if they cluster closely upon hierarchical clustering of gene expression data (Eisen et 
al. 1998). Similar patterns may be those genes that are among the 1, 2, 5, 10, 20, 50 or 100 nearest 
neighbors in a hierarchical clustering or have a similarity score (Eisen et al. 1998) of > 0.5, 0.7, 0.8, 
0.9, 0.95 or 0.99. Similar patterns may also be identified as those genes found to be surrogates in a 
classification tree by CART (Breiman et al. 1994). Often, but not always, members of a gene cluster 
have similar biological functions in addition to similar gene expression patterns. 

Correlated genes, clusters and surrogates are identified for the diagnostic genes of the 
. invention. These surrogates may be used as diagnostic genes in an assay instead of, or in addition to, 
the diagnostic genes for which they are surrogates. 

The invention also provides diagnostic probe sets. It is understood that a probe includes any 
reagent capable of specifically identifying a nucleotide sequence of the diagnostic nucleotide set, 
including but not limited to amplified DNA, amplified RNA, cDNA, synthetic oligonucleotide, partial 
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or full-length nucleic acid sequences. In addition, the probe may identify the protein product of a 
diagnostic nucleotide sequence, including, for example, antibodies and other affinity reagents. 

It is also understood that each probe can correspond to one gene, or multiple probes can 
correspond to one gene, or both, or one probe can correspond to more than one gene. 

Homologs and variants of the disclosed nucleic acid molecules may be used in the present 
invention. Homologs and variants of these nucleic acid molecules will possess a relatively high degree 
of sequence identity when aligned using standard methods. The sequences encompassed by the 
invention have at least 40-50, 50-60, 70-80, 80-85, 85-90, 90-95 or 95-100% sequence identity to the 
sequences disclosed herein. 

It is understood that for expression profiling, variations in the disclosed sequences will still 
permit detection of gene expression. The degree of sequence identity required to detect gene 
expression varies depending on the length of the oligomer. For a 60 mer, 6-8 random mutations or 6-8 
random deletions in a 60 mer do not affect gene expression detection. Hughes, TR, et al. "Expression 
profiling using microarrays fabricated by an ink-jet oligonucleotide synthesizer. Nature Biotechnology, 
19:343-347(2001). As the length of the DNA sequence is increased, the number of mutations or 
deletions permitted while still allowing gene expression detection is increased. 

As will be appreciated by those skilled in the art, the sequences of the present invention may 
contain sequencing errors. That is, there may be incorrect nucleotides, frameshifts, unknown 
nucleotides, or other types of sequencing errors in any of the sequences; however, the correct 
sequences will fall within the homology and stringency definitions herein. 

The minimum length of an oligonucleotide probe necessary for specific hybridization in the 
human genome can be estimated using two approaches. The first method uses a statistical argument 
that the probe will be unique in the human genome by chance. Briefly, the number of independent 
perfect matches (Po) expected for an oligonucleotide of length L in a genome of complexity C can be 
calculated from the equation (Laird CD, Chromosoma 32:378 (1971): 
Po=<l/4) L * 2C 

In the case of mammalian genomes, 2C = ~3,6 X 10 9 , and an oligonucleotide of 14-15 
nucleotides is expected to be represented only once in the genome. However, the distribution of 
nucleotides in the coding sequence of mammalian genomes is nonrandom (Lathe, R. J. Mol. Biol. 
183:1 (1985) and longer oligonucleotides may be preferred in order to in increase the specificity of 
hybridization. In practical terms, this works out to probes that are 19-40 nucleotides long (Sambrook J 
et al., infra). The second method for estimating the length of a specific probe is to use a probe long 
enough to hybridize under the chosen conditions and use a computer to search for that sequence or 
close matches to the sequence in the human genome and choose a unique match. Probe sequences are 
chosen based on the desired hybridization properties as described in Chapter 1 1 of Sambrook et al, 
infra. The PRJMER3 program is useful for designing these probes (S. Rozen and H. Skaletsky 
1996,1 997; Primer3 code available at the web site located at 

genome.wi.rrut.edu/genome_soirware/other/primer3.htrnl). The sequences of these probes are then 
compared pair wise against a database of the human genome sequences using a program such as 
BLAST or MEGABLAST (Madden, T.L et al(1996) Meth. Enzymol. 266:1 31-141). Since most of the 
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human genome is now contained in the database, the number of matches will be determined. Probe 
sequences are chosen that are unique to the desired target sequence. 

In some embodiments, a diagnostic probe set is immobilized on an array. The array is 
optionally comprises one or more of: a chip array, a plate array, a bead array, a pin array, a membrane 
array, a solid surface array, a liquid array, an oligonucleotide array, a polynucleotide array or a cDNA 
array, a microliter plate, a pin array, a bead array, a membrane or a chip. 

In some embodiments, the leukocyte- implicated disease is selected from the diseases listed in 
Table 1. In other embodiments, In some embodiments, the disease is atherosclerosis or cardiac 
allograft rejection. In other embodiments, the disease is congestive heart failure, angina, and 
myocardial infarction. 

In some embodiments, diagnostic nucleotides of the invention are used as a diagnostic gene 
set in combination with genes that are know to be associated with a disease state ("known markers")- 
The use of the diagnostic nucleotides in combination with the known markers can provide information 
that is not obtainable through the known markers alone. The known markers include those identified 
by the prior art listing provided. 
Hematopoeisis 

The present invention is also directed to methods of measurement of the rate of hematopoiesis 
using the diagnostic oligonucleotides of the invention and measurement of the rates of hematopoesis by 
any technique as a method for the monitoring and diagnosis of transplant rejection. Precursor and 
immature cells often have cell specific phenotypic markers. These are genes and/or proteins that 
expressed in a restricted manner in immature or precursor cells. This expression decreases with 
maturation. Gene expression markers for immature cells of a variety of lineages are given in Table 10 



below by way of example. 
Table 10: 



Gene 


Celt type 


CD10 


B-lymphoblasts 


RAG1 


B-lymphoblasts 


RAG2 


B-lymphoblasts 


NF-E2 


Plat elets/M egakary ocy te/E ry t hrold 


GATA-1 


Platelets/Megakaryocyte 


GP lib 


Platelets 


pf4 


Platelets 


EPO-R 


Erythroblast 


Band 4.1 


Erythrocyte 


ALAS2 


Erythroid specific heme biosynthesis 


hemoglobin chains 


Erythocyte 


2,3-BPG mutase 


Erythrocyte 


CD16b 


Neutrophil 


LAP 


Neutrophil 


CD16 


NK cells 


CD159a 


NK cells 1 



By measuring the levels of these and other genes in peripheral blood samples, an assessment 
of the number and proportion of immature or precursor cells can be made. Of particular use is RNA 
quantification in erythrocytes and platelets. These cells are anucleated in their mature forms. During 
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development, platelets pinch off of a megakaryocyte and take a compliment of RNA without a nucleus. 
This RNA is quickly consumed by the platelet. Erythrocytes start as nucleated cells, but the nucleus 
extrudes toward the end of the maturation process. These cells have RNA which is rapidly consumed 
within the first 2 days of the cells 120 day life span. 

For these anucleated cell types, gene expression markers must be specific only to the cell line 
(and not the immature form) to be useful as measures of cellular production rates. Genes specific to the 
lineage vs. other blood cell types will serve as markers of cellular production rates when measured on 
the RNA level. This is because RNA is specific to immature forms in these cases. For example, 
hemoglobin is specific to erythrocytes, but hemoglobin RNA is specific to newly produced 
erythrocytes. Therefore, if the rate of production of erythrocytes increases, so will the level of a 
lineage specific RNA (e.g., hemoglobin). 

Hematopoietic growth factors and cytokines have incomplete lineage specificity. G-CSF is 
administered to patient with low granulocyte counts and the effect is a stimulation of all lineages 
(granulocytes, erythrocytes, platelets, etc. . .). Hemolytic anemia leads to increased production of 
multiple cell lineages although the only lineage in increased demand is the erythrocyte. Because of this 
lack of specificity of hematopoietic responses, erythrocyte and platelet production rates may serve as 
surrogates of increased production of lymphocyte lineages. Using RBCs and platelets production rates 
as surrogates for lymphocyte lineages may be useful because of the lack of a nucleus in these cells and 
the ease of measuring cellular production rates by simply measuring lineage specific RNA levels. 

Hematopoieis rates can be measured using gene expression profiling of peripheral blood. 
RBC and platelet specific genes provide unique opportunity for this because of their lack of a nucleus 
and kinetics. New cells = new / much more RNA from these cell types in peripheral blood. Immature 
lymphocytes may be even more specific for immune activation and rejection. Cell specific markers of 
lymphocyte precursors were identified (aka lymphoblasts) see below. Granulocyte precursors and 
markers of megakaryocytes or premature forms of any blood cells may be useful in this regard. 
Applications for measuring the rate of hematopoiesis 

Changes in the rate of hematopoiesis have been correlated to a number of disease states and 
other pathologies. Examples of such conditions are listed in Table 1 1 . One of skill in the art would be 
aware of other such conditions. In addition, one aspect of the present invention is the identification of 
the linkage between changes in the rate of hematopoiesis. The methods of the present invention 
directed to measuring the rates of hematopoiesis can therefore be applied to the diagnosis and 
monitoring of a number of disease states and other pathologies. In addition, these methods can be 
beneficial in determining appropriate therapies for patients. 



Table: 11 



Disorder / condition 


Cell type 


Cell 

production 


Therapy 


Anemia - Iron 
Deficiency 


Erythrocyte 


Decreased 


Iron 


Anemia - B12, Folate 
deficiency 


Erythrocyte 


Decreased 


B12, Folate 


Anemia - Aplastic 


Erythrocyte 


Decreased 


Epogen, transfusion 
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Anemia - hemolytic 


Erythrocyte 


Increased 


Immunosuppression, 
Splenectomy 


Anemia - Renal failure 


Erythrocyte 


Decreased 


Erythropoietin 


Anemia - Chronic 
disease 


Erythrocyte 


Decreased 


Treat underlying cause 


Polycythemia rubra 
vera 


Erythrocyte 


Increased 




Idiophic 

Thrrombocytopenic 
purpura 


Platelet 


Increased 


Immunosuppression, ! 
Splenectomy 


Thrombotic 

Thrombocytopenic 

purpura 


Platelet 


Increased or 
decreased 


Immunosuppression, 
plasmapheresis 


Essential 
thrombocytosis 


Platelet 


Increased 




Leukemia 


All lineages, 
variable 


Increase, 
decreased or 
abnomal 


Chemotherapy, BMT 


Cytopenias due to 
immunosupression 


All lineages, 
variable 


Decreased 


Epo, neupogen 


Cytopenias due to 
Chemotherapy 


All lineages, 
variable 


Decreased 


Epo, GCSF, GMCSF 


GVHD 


All lineages, 
variable 


Decreased 


Immunosuppression 


Myelodysplasia 


All lineages, 
variable 


Decreased, 
increased or 
abnormal 


Chemo? 


Allograft rejection 


Lymphocytes, 
All lineages 


Increased 


Immunosuppression 


Autoimmune diseases 
(many) 


Lymphocytes, 
All lineages 


Increased 


Immunosuppression 



The methods of the present invention are also useful for monitoring treatment regimens of 
diseases or other pathologies which are correlated with changes in the rate of hematopoiesis. 
Furthermore, the methods may be used to monitor treatment with agents that affect the rate of 
hematopoiesis. One of skill in the art is aware of many such agents. The following agents are 
examples of such. 

Erythropoietin is a growth factor that is used to treat a variety of anemias that are due to 
decreased red cell production. Monitoring of red cell production by gene expression or other means 
may improve dosing and provide a means for earlier assessment of response to therapy for this 
expensive drug. 

Neupogen (G-CSF) is used for the treatment of low neutrophil counts (neutropenia) usually 
related to immunosuppression or chemotherapy. Monitoring neutrophil production by gene expression 
testing or another means may improve dosing, patient selection, and shorten duration of therapy. 

Prednisone / Immunosuppression - One of most common side effects of immunosuppression 
is suppression of hematopoiesis. This may occur in any cell lineage. Gene expression monitoring or 
other measures of hematopoietic rates could be used to monitor regularly for cytopenias in a particular 
cell line and the information could be used to modify dosing, modify therapy or add a specific 
hematologic growth factor. Following cell counts themselves is less sensitive and results in the need 
for prolonged trials of therapies at a given dose before efficacy and toxicity can be assessed. 
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Monitoring of chemotherapeutic agents -Most chemotherapy agents suppress the bone 
marrow for some or all lineages. Gene expression testing or other means of assessing hematopoietic 
rates could be used to monitor regularly for cytopenias in a particular cell line and use information to 
modify dosing, modify therapy or add a specific hematologic growth factor. 
General Molecular Biology References 

In the context of the invention, nucleic acids and/or proteins are manipulated according to well 
known molecular biology techniques. Detailed protocols for numerous such procedures are described 
in, e.g., in Ausubel et al. Current Protocols in Molecular Biology (supplemented through 2000) John 
Wiley & Sons, New York ("Ausubel"); Sambrook et al. Molecular Cloning - A Laboratory Manual 
(2nd Ed.), Vol. 1-3, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, 1989 
("Sambrook"), and Berger and Kimmel Guide to Molecular Cloning Techniques, Methods in 
Enzymologv volume 152 Academic Press, Inc., San Diego, CA ("Berger"). 

In addition to the above references, protocols for in vitro amplification techniques, such as the 
polymerase chain reaction (PCR), the ligase chain reaction (LCR), Q-replicase amplification, and other 
RNA polymerase mediated techniques (e.g., NASBA), useful e.g., for amplifying cDNA probes of the 
invention, are found in Mullis et al. (1987) U.S. Patent No. 4,683,202; PCR Protocols A Guide to 
Methods and Applications (Innis et al. eds) Academic Press Inc. San Diego, CA (1990) ("Innis"); 
Arnheim and Levinson (1990) C&EN 36; The Journal Of NIH Research (1991) 3:81 ; Kwoh et al. 

(1989) Proc Natl Acad Sci USA 86, 1 173; Guatelli et al. (1990) Proc Natl Acad Sci USA 87:1874; 
Lomell et al. (1989) J Clin Chem 35:1826; Landegren et al. (1988) Science 241:1077; Van Brunt 

(1990) Biotechnology 8:291 ; Wu and Wallace (1989) Gene 4: 560; Barringer et al. (1990) Gene 
89:1 17, and Sooknanan and Malek (1995) Biotechnology 13:563. Additional methods, useful for 
cloning nucleic acids in the context of the present invention, include Wallace et al. U.S. Pat. No. 
5,426,039. Improved methods of amplifying large nucleic acids by PCR are summarized in Cheng et 
al. (1994) Nature 369:684 and the references therein. 

Certain polynucleotides of the invention, e.g., oligonucleotides can be synthesized utilizing 
various solid-phase strategies involving mononucleotide- and/or trinucleo tide-based phosphoramidite 
coupling chemistry. For example, nucleic acid sequences can be synthesized by the sequential addition 
of activated monomers and/or trimers to an elongating polynucleotide chain. See e.g., Caruthers, M.H. 
et al. (1992) Meth Enzvmol 21 1:3. 

In lieu of synthesizing the desired sequences, essentially any nucleic acid can be custom 
ordered from any of a variety of commercial sources, such as The Midland Certified Reagent 
Company, The Great American Gene Company ExpressGen, Inc., Operon Technologies, Inc. and 
many others. 

Similarly, commercial sources for nucleic acid and protein microarrays are available, and 
include, e.g., Agilent Technologies, Palo Alto, CA Affymetrix, Santa Clara,CA ; and others. 

One area of relevance to the present invention is hybridization of oligonucleotides. Those of 
skill in the art differentiate hybridization conditions based upon the stringency of hybridization. For 
example, highly stringent conditions could include hybridization to filter-bound DNA in 0.5 M 
NaHP0 4 , 7% sodium dodecyl sulfate (SDS), 1 mM EDTA at 65° C, and washing in 0.1XSSC/0.1% 
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SDS at 68° C. (Ausubel F. M. et al., eds., 1989, Current Protocols in Molecular Biology, Vol. I, Green 
Publishing Associates, Inc., and John Wiley & sons, Inc., New York, at p. 2.10.3). Moderate 
stringency conditions could include, e.g., washing in 0.2XSSC/0.I% SDS at 42°C. (Ausubel et al., 
1989, supra). 

The invention also includes nucleic acid molecules, preferably DNA molecules, that hybridize to, and 
are therefore the complements of, the DNA sequences of the present invention. Such hybridization 
conditions may be highly stringent or less highly stringent, as described above. In instances wherein 
the nucleic acid molecules are deoxyoligonucleotides ("oligos"), highly stringent conditions may refer, 
e.g., to washing in 6xSSC/0.05% sodium pyrophosphate at 37°C (for 14-base oligos), 48°C (for 17- 
base oligos), 55°C. (for 20-base oligos), and 60°C. (for 23-base oligos). These nucleic acid molecules 
may act as target nucleotide sequence antisense molecules, useful, for example, in target nucleotide 
sequence regulation and/or as antisense primers in amplification reactions of target nucleotide sequence 
nucleic acid sequences. Further, such sequences may be used as part of ribozyme and/or triple helix 
sequences, also useful for target nucleotide sequence regulation. Still further, such molecules may be 
used as components of diagnostic methods whereby the presence of a disease-causing allele, may be 
detected. 

Identification of diagnostic nucleotide sets 
Candidate library 

Libraries of candidates that are differentially expressed in leukocytes are substrates for the 
identification and evaluation of diagnostic oligonucleotide sets and disease specific target nucleotide 
sequences. 

The term leukocyte is used generically to refer to any nucleated blood cell that is not a 
nucleated erythrocyte. More specifically, leukocytes can be subdivided into two broad classes. The 
first class includes granulocytes, including, most prevalently, neutrophils, as well as eosinophils and 
basophils at low frequency. The second class, the non-granular or mononuclear leukocytes, includes 
monocytes and lymphocytes (e.g., T cells and B cells). There is an extensive literature in the art 
implicating leukocytes, e.g., neutrophils, monocytes and lymphocytes in a wide variety of disease 
processes, including inflammatory and rheumatic diseases, neurodegenerative diseases (such as 
Alzheimer's dementia), cardiovascular disease, endocrine diseases, transplant rejection, malignancy 
and infectious diseases, and other diseases listed in Table 1. Mononuclear cells are involved in the 
chronic immune response, while granulocytes, which make up approximately 60% of the leukocytes, 
have a non-specific and stereotyped response to acute inflammatory stimuli and often have a life span 
of only 24 hours. 

In addition to their widespread involvement and/or implication in numerous disease related 
processes, leukocytes are particularly attractive substrates for clinical and experimental evaluation for a 
variety of reasons. Most importantly, they are readily accessible at low cost from essentially every 
potential subject. Collection is minimally invasive and associated with little pain, disability or 
recovery time. Collection can be performed by minimally trained personnel (e.g., phlebotomists, 
medical technicians, etc.) in a variety of clinical and non-clinical settings without significant 
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technological expenditure. Additionally, leukocytes are renewable, and thus available at multiple time 
points for a single subject. 

Assembly of an initial candidate library 
The initial candidate library was assembled from a combination of "mining" publication and sequence 
databases and construction of a differential expression library. Candidate oligonucleotide sequences in 
the library may be represented by a full-length or partial nucleic acid sequence, deoxyribonucleic acid 
(DMA) sequence, cDNA sequence, RNA sequence, synthetic oligonucleotides, etc. The nucleic acid 
sequence can be at least 19 nucleotides in length, at least 25 nucleotides, at least 40 nucleotides, at least 
100 nucleotides, or larger. Alternatively, the protein product of a candidate nucleotide sequence may 
be represented in a candidate library using standard methods, as further described below. In selecting 
and validatating diagnostic oligonucleotides, an initial library of 8,031 candidate oligonucleotide 
sequences using nucleic acid sequences of 50 nucleotides in length was constructed as described below. 

Candidate nucleotide library of the inyention 
We identified members of an initial candidate nucleotide library that are differentially 
expressed in activated leukocytes and resting leukocytes. From that initial candidate nucleotide library, 
a pool of candidates was selected as listed in Table 2, Table 8, and the seuqnce listing. Accordingly, 
the invention provides the candidate leukocyte nucleotide library comprising the nucleotide sequences 
listed in Table 2, Table 8, and in the sequence listing. In another embodiment, the invention provides 
an candidate library comprising at least one nucleotide sequence listed in Tables 2 and 8 and the 
sequence listing. In another embodiment, the invention provides an candidate library comprising at 
least two nucleotide sequences listed in Tables 2 and 8 and the sequence listing. In another 
embodiment, the at least two nucleotide sequence are at least 19 nucleotides in length, at least 35 
nucleotides, at least 40 nucleotides or at least 100 nucleotides. In some embodiments, the nucleotide 
sequences comprises deoxyribonucleic acid (DNA) sequence, ribonucleic acid (RNA) sequence, 
synthetic oligonucleotide sequence, or genomic DNA sequence. It is understood that the nucleotide 
sequences may each correspond to one gene, or that several nucleotide sequences may correspond to 
one gene, or both. 

The invention also provides probes to the candidate nucleotide library. In one embodiment of 
the invention, the probes comprise at least two nucleotide sequences listed in Table 2, Table 8, or the 
sequence listing which are differentially expressed in leukocytes in an individual with a least one 
disease criterion for at least one leukocyte-related disease and in leukocytes in an individual without 
the at least one disease criterion, wherein expression of the two or more nucleotide sequences is 
correlated with at least one disease criterion. It is understood that a probe may detect either the RNA 
expression or protein product expression of the candidate nucleotide library. Alternatively, or in 
addition, a probe can detect a genotype associated with a candidate nucleotide sequence, as further 
described below. In another embodiment, the probes for the candidaten ucleotide library are 
immobilized on an array. 

The candidate nucleotide library of the invention is useful in identifying diagnostic nucleotide 
sets of the invention and is itself a diagnostic nucleotide set of the invention, as described below. The 
candidate nucleotide sequences may be further characterized, and may be identified as a disease target 
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nucleotide sequence and/or a novel nucleotide sequence, as described below. The candidate nucleotide 
sequences may also be suitable for use as imaging reagents, as described below. 
Detection of non-leukocyte expressed genes 

When measuring gene expression levels in a blood sample, RNAs may be measured that are 
not derived from leukocytes. Examples are viral genes, free RNAs that have been released from 
damaged non-leukocyte cell types or RNA from circulating non-leukocyte cell types. For example, in 
the process of acute allograft rejection, tissue damage may result in release of allograft cells or RNAs 
derived from allograft cells into the circulation. In the case of cardiac allografts, such transcripts may 
be specific to muscle (myoglobin) or to cardiac muscle (Troponin I, Toponin T, CK-MB). Presence of 
cardiac specific mRNAs in peripheral blood may indicate ongoing or recent cardiac cellular damage 
(resulting from acute rejection). Therefore, such genes may be excellent diagnostic markers for 
allograft rejection. 

Generation of Expression Patterns 

RNA. DN A or protein sample procurement 

Following identification or assembly of a library of differentially expressed candidate 
nucleotide sequences, leukocyte expression profiles corresponding to multiple members of the 
candidate library are obtained. Leukocyte samples from one or more subjects are obtained by standard 
methods. Most typically, these methods involve trans-cutaneous venous sampling of peripheral blood. 
While sampling of circulating leukocytes from whole blood from the peripheral vasculature is 
generally the simplest, least invasive, and lowest cost alternative, it will be appreciated that numerous 
alternative sampling procedures exist, and are favorably employed in some circumstances. No 
pertinent distinction exists, in fact, between leukocytes sampled from the peripheral vasculature, and 
those obtained, e.g., from a central line, from a central artery, or indeed from a cardiac catheter, or 
during a surgical procedure which accesses the central vasculature. In addition, other body fluids and 
tissues that are, at least in part, composed of leukocytes are also desirable leukocyte samples. For 
example, fluid samples obtained from the lung during bronchoscopy may be rich in leukocytes, and 
amenable to expression profiling in the context of the invention, e.g., for the diagnosis, prognosis, or 
monitoring of lung transplant rejection, inflammatory lung diseases or infectious lung disease. Fluid 
samples from other tissues, e.g., obtained by endoscopy of the colon, sinuses, esophagus, stomach, 
small bowel, pancreatic duct, biliary tree, bladder, ureter, vagina, cervix or uterus, etc., are also 
suitable. Samples may also be obtained other sources containing leukocytes, e.g., from urine, bile, 
cerebrospinal fluid, feces, gastric or intestinal secretions, semen, or solid organ or joint biopsies. 

Most frequently, mixed populations of leukocytes, such as are found in whole blood are 
utilized in the methods of the present invention. A crude separation, e.g., of mixed leukocytes from red 
blood cells, and/or concentration, e.g., over a sucrose, percoll or ficoll gradient, or by other methods 
known in the art, can be employed to facilitate the recovery of RNA or protein expression products at 
sufficient concentrations, and to reduce non-specific background. In some instances, it can be desirable 
to purify sub-populations of leukocytes, and methods for doing so, such as density or affinity gradients, 
flow cytometry, fluorescence Activated Cell Sorting (FACS), immuno-magnetic separation, "panning," 
and the like, are described in the available literature and below. 
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Obtaining DNA. RNA and protein samples for expression profiling 

Expression patterns can be evaluated at the level of DNA, or RNA or protein products. For 
example, a variety of techniques are available for the isolation of RNA from whole blood. Any 
technique that allows isolation of mRNA from cells (in the presence or absence of rRNA and tRNA) 
can be utilized. In brief, one method that allows reliable isolation of total RNA suitable for subsequent 
gene expression analysis, is described as follows. Peripheral blood (either venous or arterial) is drawn 
from a subject, into one or more sterile, endotoxin free, tubes containing an anticoagulant (e.g., EDTA, 
citrate, heparin, etc.). Typically, the sample is divided into at least two portions. One portion, e.g., of 
5-8 ml of whole blood is frozen and stored for future analysis, e.g., of DNA or protein. A second 
portion, e.g., of approximately 8 ml whole blood is processed for isolation of total RNA by any of a 
variety of techniques as described in, e.g, Sambook, Ausubel, below, as well as U.S. Patent Numbers: 
5,728,822 and 4,843,155. 

Typically, a subject sample of mononuclear leukocytes obtained from about 8 ml of whole 
blood, a quantity readily available from an adult human subject under most circumstances, yields 5-20 
ug of total RNA. This amount is ample, e.g., for labeling and hybridization to at least two probe 
arrays. Labeled probes for analysis of expression patterns of nucleotides of the candidate libraries are 
prepared from the subject's sample of RNA using standard methods. In many cases, cDNA is 
synthesized from total RNA using a polyT primer and labeled, e.g., radioactive or fluorescent, 
nucleotides. The resulting labeled cDNA is then hybridized to probes corresponding to members of the 
candidate nucleotide library, and expression data is obtained for each nucleotide sequence in the 
library. RNA isolated from subject samples (e.g., peripheral blood leukocytes, or leukocytes obtained 
from other biological fluids and samples) is next used for analysis of expression patterns of nucleotides 
of the candidate libraries. 

In some cases, however, the amount of RNA that is extracted from the leukocyte sample is 
limiting, and amplification of the RNA is desirable. Amplification may be accomplished by increasing 
the efficiency of probe labeling, or by amplifying the RNA sample prior to labeling. It is appreciated 
that care must be taken to select an amplification procedure that does not introduce any bias (with 
respect to gene expression levels) during the amplification process. 

Several methods are available that increase the signal from limiting amounts of RNA, e.g. use 
of the Clontech (Glass Fluorescent Labeling Kit) or Stratagene (Fairplay Microarray Labeling Kit), or 
the Micromax kit (New England Nuclear, Inc.). Alternatively, cDNA is synthesized from RNA using a 
T7- polyT primer, in the absence of label, and DNA dendrimers from Genisphere (3DNA Submicro) 
are hybridized to the poly T sequence on the primer, or to a different "capture sequence" which is 
complementary to a fluorescently labeled sequence. Each 3DNA molecule has 250 fluorescent 
molecules and therefore can strongly label each cDNA. 

Alternatively, the RNA sample is amplified prior to labeling. For example, linear 
amplification may be performed, as described in U.S. Patent No. 6,132,997. A T7-polyT primer is 
used to generate the cDNA copy of the RNA. A second DNA strand is then made to complete the 
. substrate for amplification. The T7 promoter incorporated into the primer is used by a T7 polymerase 
to produce numerous antisense copies of the original RNA, Fluorescent dye labeled nucleotides are 
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directly incorporated into the RNA. Alternatively, amino ally! labeled nucleotides are incorporated 
into the RNA, and then fluorescent dyes are chemically coupled to the amino ally! groups, as described 
in Hughes. Other exemplary methods for amplification are described below. 

It is appreciated that the RNA isolated must contain RNA derived from leukocytes, but may 
also contain RNA from other cell types to a variable degree. Additionally, the isolated RNA may come 
from subsets of leukocytes, e.g. monocytes and/or T-lyrnphocytes, as described above. Such 
consideration of cell type used for the derivation of RNA depend on the method of expression profiling 
used. Subsets of leukocytes can be obtained by fluorescence activated cell sorting (FACS), 
microfluidics cell seperation systems or a variety of other methods. Cell sorting may be necessary for 
the discovery of diagnostic gene sets, for the implementation of gene sets as products or both. Cell 
sorting can be achieved with a variety of technologies (See Galbraith et al. 1999, Cantor et al. 1975, see 
also the technology of Guava Technologies, Hayward, CA). 

DNA samples may be obtained for analysis of the presence of DNA mutations, single 
nucleotide polymorphisms (SNPs), or other polymorphisms. DNA is isolated using standard 
techniques, e.g. Maniatus, supra. 

Expression of products of candidate nucleotides may also be assessed using proteomics. 
Protein(s) are detected in samples of patient serum or from leukocyte cellular protein. Serum is 
prepared by centrifugation of whole blood, using standard methods. Proteins present in the serum may 
have been produced from any of a variety of leukocytes and non-leukocyte cells, and include secreted 
proteins from leukocytes. Alternatively, leukocytes or a desired sub-population of leukocytes are 
prepared as described above. Cellular protein is prepared from leukocyte samples using methods well 
known in the art, e.g., Trizol (Invitrogen Life Technologies, cat # 15596108; Chomczynski, P. and 
Sacchi, N. (1987) Anal. Biochenx 162, 156; Siirtms, D., Cizdziel, P.E., and Chomczynski, P. (1993) 
Focus® 15, 99; Chomczynski, P., Bowers-Finn, R., and Sabatini, L. (1987) J. of NIH Res. 6, 83; 
Chomczynski, P. (1993) Bio/Techniques 15, 532; Bracete, A.M., Fox, D.K., and Sirnms, D. (1998) 
Focus 20, 82; Sewall, A. and McRae, S. (1998) Focus 20, 36; Anal Biochem 1984 Apr;138(l):141-3, A 
method for the quantitative recovery of protein in dilute solution in the presence of detergents and 
lipids; Wessel D, Flugge UI. (1984) Anal Biochenx 1984 Apr;138(l): 141-143. 

The assay itself may be a cell sorting assay in which cells are sorted and/or counted based on 
cell surface expression of a protein marker. (See Cantor et al. 1975, Galbraith et ah 1999) 
Obtaining expression patterns 

Expression patterns, or profiles, of a plurality of nucleotides corresponding to members of the 
candidate library are then evaluated in one or more samples of leukocytes. Typically, the leukocytes are 
derived from patient peripheral blood samples, although, as indicated above, many other sample 
sources are also suitable. These expression patterns constitute a set of relative or absolute expression 
values for a some number of RNAs or protein products corresponding to the plurality of nucleotide 
sequences evaluated, which is referred to herein as the subject's "expression profile" for those 
nucleotide sequences. While expression patterns for as few as one independent member of the 
candidate library can be obtained, it is generally preferable to obtain expression patterns corresponding 
to a larger number of nucleotide sequences, e.g., about 2, about 5, about 10, about 20, about 50, about 
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100, about 200, about 500, or about 1000, or more. The expression pattern for each differentially 
expressed component member of the library provides a finite specificity and sensitivity with respect to 
predictive value, e.g., for diagnosis, prognosis, monitoring, and the like. 
Clinical Studies. Data and Patient Groups 

For the purpose of discussion, the term subject, or subject sample of leukocytes, refers to an 
individual regardless of health and/or disease status. A subject can be a patient, a study participant, a 
control subject, a screening subject, or any other class of individual from whom a leukocyte sample is 
obtained and assessed in the context of the invention. Accordingly, a subject can be diagnosed with a 
disease, can present with one or more symptom of a disease, or a predisposing factor, such as a family 
(genetic) or medical history (medical) factor, for a disease, or the like. Alternatively, a subject can be 
healthy with respect to any of the aforementioned factors or criteria. It will be appreciated that the term 
"healthy" as used herein, is relative to a specified disease, or disease factor, or disease criterion, as the 
term "healthy" cannot be defined to correspond to any absolute evaluation or status. Thus, an 
individual defined as healthy with reference to any specified disease or disease criterion, can in fact be 
diagnosed with any other one or more disease, or exhibit any other one or more disease criterion. 

Furthermore, while the discussion of the invention focuses, and is exemplified using human 
sequences and samples, the invention is equally applicable, through construction or selection of 
appropriate candidate libraries, to non-human animals, such as laboratory animals, e.g., mice, rats, 
guinea pigs, rabbits; domesticated livestock, e.g., cows, horses, goats, sheep, chicken, etc.; and 
companion animals, e.g., dogs, cats, etc. 

Methods for obtaining expression data 

Numerous methods for obtaining expression data are known, and any one or more of these 
techniques, singly or in combination, are suitable for deteimining expression profiles in the context of 
the present invention. For example, expression patterns can be evaluated by northern analysis, PCR, 
RT-PCR, Taq Man analysis, FRET detection, monitoring one or more molecular beacon, hybridization 
to an oligonucleotide array, hybridization to a cDNA array, hybridization to a polynucleotide array, 
hybridization to a liquid microarray, hybridization to a microelectric array, molecular beacons, cDNA 
sequencing, clone hybridization, cDNA fragment fingerprinting, serial analysis of gene expression 
(SAGE), subtractive hybridization, differential display and/or differential screening (see, e.g., Lockhart 
and Winzeler (2000) Nature 405:827-836, and references cited therein). 

For example, specific PCR primers are designed to a member(s) of an candidate nucleotide 
library. cDNA is prepared from subject sample RNA by reverse transcription from a poly-dT 
oligonucleotide primer, and subjected to PCR. Double stranded cDNA may be prepared using primers 
suitable for reverse transcription of the PCR product, followed by amplification of the cDNA using in 
vitro transcription. The product of in vitro transcription is a sense-RN A corresponding to the original 
member(s) of the candidate library. PCR product may be also be evaluated in a number of ways known 
in the art, including real-time assessment using detection of labeled primers, e.g. TaqMan or molecular 
beacon probes. Technology platforms suitable for analysis of PCR products include the ABI 7700, 
5700, or 7000 Sequence Detection Systems (Applied Biosystems, Foster City, CA), the MJ Research 
Opticon (MJ Research, Waltham, MA), the Roche Light Cycler (Roche Diagnositics, Indianapolis, IN), 
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the Stratagene MX4000 (Stratagene, La Jolla, CA), and the Bio-Rad iCycler (Bio-Rad Laboratories, 
Hercules, CA). Alternatively, molecular beacons are used to detect presence of a nucleic acid sequence 
in an unamplified RNA or cDNA sample, or following amplification of the sequence using any 
method, e.g. IVT (In Vitro transcription) or NASBA (nucleic acid sequence based amplification). 
Molecular beacons are designed with sequences complementary to member(s) of an candidate 
nucleotide library, and are linked to fluorescent labels. Each probe has a different fluorescent label 
with non-overlapping emission wavelengths. For example, expression often genes may be assessed 
using ten different sequence-specific molecular beacons. 

Alternatively, or in addition, molecular beacons are used to assess expression of multiple 
nucleotide sequences at once. Molecular beacons with sequence complimentary to the members of a 
diagnostic nucleotide set are designed and linked to fluorescent labels. Each fluorescent label used 
must have a non-overlapping emission wavelength. For example, 10 nucleotide sequences can be 
assessed by hybridizing 10 sequence specific molecular beacons (each labeled with a different 
fluorescent molecule) to an amplified or un-amplified RNA or cDNA sample. Such an assay bypasses 
the need for sample labeling procedures. 

Alternatively, or in addition bead arrays can be used to assess expression of multiple 
sequences at once. See, e.g, LabMAP 100, Luminex Corp, Austin, Texas). Alternatively, or in 
addition electric arrays are used to assess expression of multiple sequences, as exemplified by the e- 
Sensor technology of Motorola (Chicago, 111.) or Nanochip technology of Nanogen (San Diego, CA.) 

Of course, the particular method elected will be dependent on such factors as quantity of RNA 
recovered, practitioner preference, available reagents and equipment, detectors, and the like. Typically, 
however, the elected method(s) will be appropriate for processing the number of samples and probes of 
interest. Methods for high-throughput expression analysis are discussed below. 

Alternatively, expression at the level of protein products of gene expression is performed. For 
example, protein expression, in a sample of leukocytes, can be evaluated by one or more method 
selected from among: western analysis, two-dimensional gel analysis, chromatographic separation, 
mass spectrometric detection, protein-fusion reporter constructs, colorimetric assays, binding to a 
protein array and characterization of polysomal mRNA. One particularly favorable approach involves 
binding of labeled protein expression products to an array of antibodies specific for members of the 
candidate library. Methods for producing and evaluating antibodies are widespread in the art, see, e.g., 
Coligan, supra; and Harlow and Lane (1989) Antibodies: A Laboratory Manual. Cold Spring Harbor 
Press, NY ("Harlow and Lane"). Additional details regarding a variety of immunological and 
immunoassay procedures adaptable to the present invention by selection of antibody reagents specific 
for the products of candidate nucleotide sequences can be found in, e.g., Stites and Terr (eds.)(1991) 
Basic and Clinical Immunology. 7 th ed., and Paul, supra. Another approach uses systems for 
performing desorption spectrometry. Commercially available systems, e.g., from Ciphergen 
Biosys terns, Inc. (Fremont, CA) are particularly well suited to quantitative analysis of protein 
expression. Indeed, Protein Chip® arrays (see, e.g., the web site ciphergen.com) used in desorption 
spectrometry approaches provide arrays for detection of protein expression. Alternatively, affinity 
reagents, e.g., antibodies, small molecules, etc.) are developed that recognize epitopes of the protein 
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product. Affinity assays are used in protein array assays, e.g. to detect the presence or absence of 
particular proteins. Alternatively, affinity reagents are used to detect expression using the methods 
described above. In the case of a protein that is expressed on the cell surface of leukocytes, labeled 
affinity reagents are bound to populations of leukocytes, and leukocytes expressing the protein are 
identified and counted using fluorescent activated cell sorting (FACS). 

It is appreciated that the methods of expression evaluation discussed herein, although 
discussed in the context of discovery of diagnostic nucleotide sets, are equally applicable for 
expression evaluation when using diagnostic nucleotide sets for, e.g. diagnosis of diseases, as further 
discussed below. 

High Throughput Expression Assays 

A number of suitable high throughput formats exist for evaluating gene expression. Typically, 
the term high throughput refers to a format that performs at least about 100 assays, or at least about 500 
assays, or at least about 1000 assays, or at least about 5000 assays, or at least about 10,000 assays, or 
more per day. When enumerating assays, either the number of samples or the number of candidate 
nucleotide sequences evaluated can be considered. For example, a northern analysis of, e.g., about 100 
samples performed in a gridded array, e.g., a dot blot, using a single probe corresponding to an 
candidate nucleotide sequence can be considered a high throughput assay. More typically, however, 
such an assay is performed as a series of duplicate blots, each evaluated with a distinct probe 
corresponding to a different member of the candidate library. Alternatively, methods that 
simultaneously evaluate expression of about 100 or more candidate nucleotide sequences in one or 
more samples, or in multiple samples, are considered high throughput. 

Numerous technological platforms for performing high throughput expression analysis are 
known. Generally, such methods involve a logical or physical array of either the subject samples, or 
the candidate library, or both. Common array formats include both liquid and solid phase arrays. For 
example, assays employing liquid phase arrays, e.g., for hybridization of nucleic acids, binding of 
antibodies or other receptors to ligand, etc., can be performed in multiwell, or microtiter, plates. 
Microtiter plates with 96, 384 or 1536 wells are widely available, and even higher numbers of wells, 
e.g, 3456 and 9600 can be used. In general, the choice of microtiter plates is determined by the 
methods and equipment, e.g., robotic handling and loading systems, used for sample preparation and 
analysis. Exemplary systems include, e.g., the ORCA™ system from Beckman-Coulter, Inc. 
(Fullerton, CA) and the Zymate systems from Zymark Corporation (Hopkinton, MA). 

Alternatively, a variety of solid phase arrays can favorably be employed in to determine 
expression patterns in the context of the invention. Exemplary formats include membrane or filter 
arrays (e.g, nitrocellulose, nylon), pin arrays, and bead arrays (e.g., in a liquid "slurry"). Typically, 
probes corresponding to nucleic acid or protein reagents that specifically interact with (e.g., hybridize 
to or bind to) an expression product corresponding to a member of the candidate library are 
immobilized, for example by direct or. indirect cross-linking, to the solid support. Essentially any solid 
support capable of withstanding the reagents and conditions necessary for performing the particular 
expression assay can be utilized. For example, functionalized glass, silicon, silicon dioxide, modified 
silicon, any of a variety of polymers, such as (poly)terrafluoroethylene, (poly)vinylidenedifluoride, 
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polystyrene, polycarbonate, or combinations thereof can all serve as the substrate for a solid phase 
array. 

In a preferred embodiment, the array is a "chip" composed, e.g., of one of the above specified 
materials. Polynucleotide probes, e.g., RNA or DNA, such as cDNA, synthetic oligonucleotides, and 
the like, or binding proteins such as antibodies, that specifically interact with expression products of 
individual components of the candidate library are affixed to the chip in a logically ordered manner, 
i.e., in an array. In addition, any molecule with a specific affinity for either the sense or anti-sense 
sequence of the marker nucleotide sequence (depending on the design of the sample labeling), can be 
fixed to the array surface without loss of specific affinity for the marker and can be obtained and 
produced for array production, for example, proteins that specifically recognize the specific nucleic 
acid sequence of the marker, ribozymes, peptide nucleic acids (PNA), or other chemicals or molecules 
with specific affinity. 

Detailed discussion of methods for linking nucleic acids and proteins to a chip substrate, are 
found in, e.g., US Patent No. 5,143,854 "LARGE SCALE PHOTOLITHOGRAPHIC SOLID PHASE 
SYNTHESIS OF POLYPEPTIDES AND RECEPTOR BINDING SCREENING THEREOF" to 
Pirrung et al., issued, September 1, 1992; US Patent No. 5,837,832 "ARRAYS OF NUCLEIC ACID 
PROBES ON BIOLOGICAL CHIPS" to Chee et al., issued November 17, 1998; US Patent No. 
6,087,1 12 "ARRAYS WITH MODIFIED OLIGONUCLEOTIDE AND POLYNUCLEOTIDE 
COMPOSITIONS" to Dale, issued July 11, 2000; US Patent No. 5,215,882 "METHOD OF 
IMMOBILIZING NUCLEIC ACID ON A SOLID SUBSTRATE FOR USE IN NUCLEIC ACID 
HYBRIDIZATION ASSAYS" to Bahl et al., issued June 1, 1993; US Patent No. 5,707,807 
"MOLECULAR INDEXING FOR EXPRESSED GENE ANALYSIS" to Kato, issued January 13, 
1998; US Patent No. 5,807,522 "METHODS FOR FABRICATING M1CROARRAYS OF 
BIOLOGICAL SAMPLES" to Brown et al., issued September 15, 1998; US Patent No. 5,958,342 
"JET DROPLET DEVICE" to Gamble et al., issued Sept. 28, 1999; US Patent 5,994,076 "METHODS 
OF ASSAYING DIFFERENTIAL EXPRESSION" to Chenchik et al., issued Nov. 30, 1999; US Patent 
No. 6,004,755 "QUANTITATIVE MICROARRAY HYBRIDIZATION ASSAYS" to Wang, issued 
Dec. 21, 1999; US Patent No. 6,048,695 "CHEMICALLY MODIFIED NUCLEIC ACIDS AND 
METHOD FOR COUPLING NUCLEIC ACIDS TO SOLID SUPPORT" to Bradley et al., issued April 
1 1, 2000; US Patent No. 6,060,240 "METHODS FOR MEASURING RELATIVE AMOUNTS OF 
NUCLEIC ACIDS IN A COMPLEX MIXTURE AND RETRIEVAL OF SPECIFIC SEQUENCES 
THEREFROM" to Kamb et al., issued May 9, 2000; US Patent No. 6,090,556 "METHOD FOR 
QUANTITATIVELY DETERMINING THE EXPRESSION OF A GENE" to Kato, issued July 18, 
2000; and US Patent 6,040,138 "EXPRESSION MONITORING BY HYBRIDIZATION TO HIGH 
DENSITY OLIGONUCLEOTIDE ARRAYS" to Lockhart et al., issued March 21, 2000 each of which 
are hereby incorporated by reference in their entirety,. 

For example, cDNA inserts corresponding to candidate nucleotide sequences, in a standard 
TA cloning vector are amplified by a polymerase chain reaction for approximately 30-40 cycles. The 
amplified PCR products are then arrayed onto a glass support by any of a variety of well known 
techniques, e.g., the VSL1PS™ technology described in US Patent No. 5,143,854. RNA, or cDNA 
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corresponding to RN A, isolated from a subject sample of leukocytes is labeled, e.g., with a fluorescent 
tag, and a solution containing the RNA (or cDNA) is incubated under conditions favorable for 
hybridization, with the "probe" chip. Following incubation, and washing to eliminate non-specific 
hybridization, the labeled nucleic acid bound to the chip is detected qualitatively or quantitatively, and 
the resulting expression profile for .the corresponding candidate nucleotide sequences is recorded. It is 
appreciated that the probe used for diagnostic purposes may be identical to the probe used during 
diagnostic nucleotide sequence discovery and validation. Alternatively, the probe sequence may be 
different than the sequence used in diagnostic nucleotide sequence discovery and validation. Multiple 
cDNAs from a nucleotide sequence that are non-overlapping or partially overlapping may also be used. 

In another approach, oligonucleotides corresponding to members of an candidate nucleotide 
library are synthesized and spotted onto an array. Alternatively, oligonucleotides are synthesized onto 
the array using methods known in the art, e.g. Hughes, et al. supra. The oligonucleotide is designed to 
be complementary to any portion of the candidate nucleotide sequence. In addition, in the context of 
expression analysis for, e.g. diagnostic use of diagnostic nucleotide sets, an oligonucleotide can be 
designed to exhibit particular hybridization characteristics, or to exhibit a particular specificity and/or 
sensitivity, as further described below. 

Hybridization signal may be amplified using methods known in the art, and as described 
herein, for example use of the Clontech kit (Glass Fluorescent Labeling Kit), Stratagene kit (Fairplay 
Microarray Labeling Kit), the Micromax kit (New England Nuclear, Inc.), the Genisphere kit (3DNA 
Submicro), linear amplification, e.g. as described in U.S. Patent No. 6,132,997 or described in Hughes, 
TR, et al., Nature Biotechnology, 19:343-347 (2001) and/or Westin et al. Nat Biotech. 18:199-204. 

Alternatively, fluorescently labeled cDNA are hybridized directly to the microarray using 
methods known in the art. For example, labeled cDNA are generated by reverse transcription using 
Cy3- and Cy5-conjugated deoxynucleotides, and the reaction products purified using standard methods. 
It is appreciated that the methods for signal amplification of expression data useful for identifying 
diagnostic nucleotide sets are also useful for amplification of expression data for diagnostic purposes. 

Microarray expression may be detected by scanning the microarray with a variety of laser or 
CCD-based scanners, and extracting features with numerous software packages, for example, Imagene 
(Biodiscovery), Feature Extraction (Agilent), Scanalyze (Eisen, M. 1999. SCANALYZE User Manual; 
Stanford Univ., Stanford, CA. Ver 2.32.), GenePix (Axon Instruments). 

In another approach, hybridization to microelectric arrays is performed, e.g. as described in 
Umek et al (2001) J Mol Diagn. 3:74-84. An affinity probe, e.g. DNA, is deposited on a metal surface. 
The metal surface underlying each probe is connected to a metal wire and electrical signal detection 
system. Unlabelled RNA or cDNA is hybridized to the array, or alternatively, RNA or cDNA sample 
is amplified before hybridization, e.g. by PCR. Specific hybridization of sample RNA or cDNA results 
in generation of an electrical signal, which is transmitted to a detector. See Westin (2000) Nat Biotech. 
18:199-204 (describing anchored multiplex amplification of a microelectronic chip array); Edman 
(1997) NAR 25:4907-14; Vignali (2000) J Immunol Methods 243:243-55. 

In another approach, a microfluidics chip is used for RNA sample preparation and analysis. 
This approach increases efficiency because sample preparation and analysis are streamlined. Briefly, 

70 



WO 2004/042346 



PC17US2003/012946 



micro flu idics may be used to sort specific leukocyte sub-populations prior to RNA preparation and 

analysis. Microfluidics chips are also useful for, e.g., RNA preparation, and reactions involving RNA 

(reverse transcription, RT-PCR). Briefly, a small volume of whole, anti-coagulated blood is loaded 

onto a microfluidics chip, for example chips available from Caliper (Mountain View, CA) or Nanogen 

(San Diego, CA.) A microfluidics chip may contain channels and reservoirs in which cells are moved 
... . . . .- . 

and reactions are performed. Mechanical, electrical, magnetic, gravitational, centrifugal or other forces 

are used to move the cells and to expose them to reagents. For example, cells of whole blood are 

moved into a chamber containing hypotonic saline, which results in selective lysis of red blood cells 

after a 20-minute incubation. Next, the remaining cells (leukocytes) are moved into a wash chamber 

and finally, moved into a chamber containing a lysis buffer such as guanidine isothyocyanate. The 

leukocyte cell lysate is further processed for RNA isolation in the chip, or is then removed for further 

processing, for example, RNA extraction by standard methods. Alternatively, the microfluidics chip is 

a circular disk containing ficoll or another density reagent. The blood sample is injected into the center 

of the disc, the disc is rotated at a speed that generates a centrifugal force appropriate for density 

gradient separation of mononuclear cells, and the separated mononuclear cells are then harvested for 

further analysis or processing. 

It is understood that the methods of expression evaluation, above, although discussed in the 
context of discovery of diagnostic nucleotide sets, are also applicable for expression evaluation when 
using diagnostic nucleotide sets for, e.g. diagnosis of diseases, as further discussed below. 
Evaluation of expression patterns 

Expression patterns can be evaluated by qualitative and/or quantitative measures. Certain of 
the above described techniques for evaluating gene expression (as RNA or protein products) yield data 
that are predominantly qualitative in nature. That is, the methods detect differences in expression that 
classify expression into distinct modes without providing significant information regarding quantitative 
aspects of expression. For example, a technique can be described as a qualitative technique if it detects 
the presence or absence of expression of an candidate nucleotide sequence, i.e., an on/ofT pattern of 
expression. Alternatively, a qualitative technique measures the presence (and/or absence) of different 
alleles, or variants, of a gene product. 

In contrast, some methods provide data that characterizes expression in a quantitative manner. 
That is, the methods relate expression on a numerical scale, e.g., a scale of 0-5, a scale of 1-10, a scale 
of + - +++, from grade 1 to grade 5, a grade from a to z, or the like. It will be understood that the 
numerical, and symbolic examples provided are arbitrary, and that any graduated scale (or any 
symbolic representation of a graduated scale) can be employed in the context of the present invention 
to describe quantitative differences in nucleotide sequence expression. Typically, such methods yield 
information corresponding to a relative increase or decrease in expression. 

Any method that yields either quantitative or qualitative expression data is suitable for 
evaluating expression of candidate nucleotide sequence in a subject sample of leukocytes. In some 
cases, e.g., when multiple methods are employed to determine expression patterns for a plurality of 
candidate nucleotide sequences, the recovered data, e.g., the expression profile, for the nucleotide 
. sequences is a combination of quantitative and qualitative data. 
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In some applications, expression of the plurality of candidate nucleotide sequences is 
evaluated sequentially. This is typically the case for methods that can be characterized as low- to 
moderate-throughput. In contrast, as the throughput of the elected assay increases, expression for the 
plurality of candidate nucleotide sequences in a sample or multiple samples of leukocytes, is assayed 
simultaneously. Again, the methods (and throughput) are largely determined by the individual 
practitioner, although, typically, it is preferable to employ methods that permit rapid, e.g. automated or 
partially automated, preparation and detection, on a scale that is time-efficient and cost-effective. 

It is understood that the preceding discussion, while directed at the assessment of expression 
of the members of candidate libraries, is also applies to the assessment of the expression of members of 
diagnostic nucleotide sets, as further discussed below. 
Genotvping 

In addition to, or in conjunction with the correlation of expression profiles and clinical data, it 
is often desirable to correlate expression patterns with the subject's genotype at one or more genetic 
loci. The selected loci can be, for example, chromosomal loci corresponding to one or more member of 
the candidate library, polymorphic alleles for marker loci, or alternative disease related loci (not 
contributing to the candidate library) known to be, or putatively associated with, a disease (or disease 
criterion). Indeed, it will be appreciated, that where a (polymorphic) allele at a locus is linked to a 
disease (or to a predisposition to a disease), the presence of the allele can itself be a disease criterion. 

Numerous well known methods exist for evaluating the genotype of an individual, including 
southern analysis, restriction fragment length polymorphism (RFLP) analysis, polymerase chain 
reaction (PCR), amplification length polymorphism (AFLP) analysis, single stranded conformation 
polymorphism (SSCP) analysis, single nucleotide polymorphism (SNP) analysis (e.g., via PCR, 
Taqman or molecular beacons), among many other useful methods. Many such procedures are readily 
adaptable to high throughput and/or automated (or semi-automated) sample preparation and analysis 
methods. Most, can be performed on nucleic acid samples recovered via simple procedures from the 
same sample of leukocytes as yielded the material for expression profiling. Exemplary techniques are 
described in, e.g., Sambrook, and Ausubel, supra. 

Identification of the diagnostic nucleotide sets of the invention 

Identification of diagnostic nucleotide sets and disease specific target nucleotide sequence 
proceeds by correlating the leukocyte expression profiles with data regarding the subject's health status 
to produce a data set designated a "molecular signature." Examples of data regarding a patient's health 
status, also termed "disease criteria(ion)", is described below and in the Section titled "selected 
diseases," below. Methods useful for correlation analysis are further described elsewhere in the 
specification. 

Generally, relevant data regarding the subject's health status includes retrospective or 
prospective health data, e.g., in the form of the subject's medical histpry^as provided by the subject, 
physician or third party, such as, medical diagnoses, laboratory test results, diagnostic test results, 
clinical events, or medication lists, as further described below. Such data may include information 
regarding a patient's response to treatment and/or a particular medication and data regarding the 
presence of previously characterized "risk factors." For example, cigarette smoking and obesity are 
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previously identified risk factors for heart disease. Further examples of health status information, 
including diseases and disease criteria, is described in the section titled Selected diseases, below. 

Typically, the data describes prior events and evaluations (i.e., retrospective data). However, it 
is envisioned that data collected subsequent to the sampling (i.e., prospective data) can also be 
correlated with the expression profile. The tissue sampled, e.g., peripheral blood, bronchial lavage, 
etc., can be obtained at one or more multiple time points and subject data is considered retrospective or 
prospective with respect to the time of sample procurement. 

Data collected at multiple time points, called "longitudinal data", is often useful, and thus, the 
invention encompasses the analysis of patient data collected from the same patient at different time 
points. Analysis of paired samples, such as samples from a patient at different time, allows 
identification of differences that are specifically related to the disease state since the genetic variability 
specific to the patient is controlled for by the comparison. Additionally, other variables that exist 
between patients may be controlled for in this way, for example, the presence or absence of 
inflammatory diseases (e.g., rheumatoid arthritis) the use of medications that may effect leukocyte gene 
expression, the presence or absence of co-morbid conditions, etc. Methods for analysis of paired 
samples are further described below. Moreover, the analysis of a pattern of expression profiles 
(generated by collecting multiple expression profiles) provides information relating to changes in 
expression level over time, and may permit the determination of a rate of change, a trajectory, or an 
expression curve. Two longitudinal samples may provide information on the change in expression of a 
gene over time, while three longitudinal samples may be necessary to determine the "trajectory" of 
expression of a gene. Such information may be relevant to the diagnosis of a disease. For example, the 
expression of a gene may vary from individual to individual, but a clinical event, for example , a heart 
attack, may cause the level of expression to double in each patient. In this example, clinically 
interesting information is gleaned from the change in expression level, as opposed to the absolute level 
of expression in each individual. 

When a single patient sample is obtained, it may still be desirable to compare the expression 
profile of that sample to some reference expression profile. In this case, one can determine the change 
of expression between the patient's sample and a reference expression profile that is appropriate for 
that patient and the medical condition in question. For example, a reference expression profile can be 
determined for all patients without the disease criterion in question who have similar characteristics, 
such as age, sex, race, diagnoses etc. 

Generally, small sample sizes of 20-100 samples are used to identify a diagnostic nucleotide 
set. Larger sample sizes are generally necessary to validate the diagnostic nucleotide set for use in 
large and varied patient populations, as further described below. For example, extension of gene 
expression correlations to varied ethnic groups, demographic groups, nations, peoples or races may 
require expression correlation experiments on the population of interest. 
Expression Reference Standards 

Expression profiles derived from a patient (i.e., subjects diagnosed with, or exhibiting 
symptoms of, or exhibiting a disease criterion, or under a doctor's care for a disease) sample are 
compared to a control or standard expression RNA to facilitate comparison of expression profiles (e.g. 
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of a set of candidate nucleotide sequences) from a group of patients relative to each other (i.e., from 
one patient in the group to other patients in the group, or to patients in another group). 

The reference RNA used should have desirable features of low cost and simplicity of 
production on a large scale. Additionally, the reference RNA should contain measurable amounts of as 
many of the genes of the candidate library as possible. 

For example, in one approach to identifying diagnostic nucleotide sets, expression profiles 
derived from patient samples are compared to a expression reference "standard." Standard expression 
reference can be, for example, RNA derived from resting cultured leukocytes or commercially 
available reference RNA, such as Universal reference RNA from Stratagene. See Nature, V406, 8-17- 
00, p. 747-752. Use of an expression reference standard is particularly useful when the expression of 
large numbers of nucleotide sequences is assayed, e.g. in an array, and in certain other applications, e.g. 
qualitative PCR, RT-PCR, etc., where it is desirable to compare a sample profile to a standard profile, 
and/or when large numbers of expression profiles, e.g. a patient population, are to be compared. 
Generally, an expression reference standard should be available in large quantities, should be a good 
substrate for amplification and labeling reactions, and should be capable of detecting a large percentage 
of candidate nucleic acids using suitable expression profiling technology. 

Alternatively, or in addition, the expression profile derived from a patient sample is compared 
with the expression of an internal reference control gene, for example, p-actin or CD4. The relative 
expression of the profiled genes and the internal reference control gene (from the same individual) is 
obtained. An internal reference control may also be used with a reference RNA. For example, an 
expression profile for "gene 1" and the gene encoding CD4 can be detennined in a patient sample and 
in a reference RNA. The expression of each gene can be expressed as the "relative" ratio of 
expression the gene in the patient sample compared with expression of the gene in the reference RNA. 
The expression ratio (sample/reference) for gene 1 may be divided by the expression ration for CD4 
(sample/reference) and thus the relative expression of gene 1 to CD4 is obtained. 

The invention also provides a buffy coat control RNA useful for expression profiling, and a 
method of using control RNA produced from a population of buffy coat cells, the white blood cell layer 
derived from the centrifugation of whole blood. Buffy coat contains all white blood cells, including 
granulocytes, mononuclear cells and platelets. The invention also provides a method of preparing 
control RNA from buffy coat cells for use in expression profile analysis of leukocytes. Buffy coat 
fractions are obtained, e.g. from a blood bank or directly from individuals, preferably from a large 
number of individuals such that bias from individual samples is avoided and so that the RNA sample 
represents an average expression of a healthy population. Buffy coat fractions from about 50 or about 
1 00, or more individuals are preferred. 10 ml buffy coat from each individual is used. Buffy coat 
samples are treated with an erythrocyte lysis buffer, so that erythrocytes are selectively removed. 
_ The leukocytes of the Vuffy coat layer are collected by centrifugation. Alternatively, the buffy cell 
sample can be further enriched for a particular leukocyte sub-populations, e.g. mononuclear cells, T- 
lymphocytes, etc." To enrich for mononuclear cells, the buffy cell pellet, above, is diluted in PBS 
(phosphate buffered saline) and loaded onto a non-polystyrene tube containing a polysucrose and 
sodium diatrizoate solution adjusted to a density of 1 .077+/-0.001 g/ml. To enrich for T-lymphocytes, 
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45 ml of whole blood is treated with RosetteSep (Stem Cell Technologies), and incubated at room 
temperature for 20 minutes. The mixture is diluted with an equal volume of PBS plus 2% FBS and 
mixed by inversion. 30 ml of diluted mixture is layered on top of 15 ml DML medium (Stem Cell 
Technologies). The tube is centrifuged at 1200 x g, and the enriched cell layer at the plasma : medium 
interface is removed, washed with PBS + 2% FBS, and cells collected by centrifugation at 1200 x g. 
The cell pellet is treated with 5 ml of erythrocyte lysis buffer (EL buffer, Qiagen) for 10 minutes on 
ice, and enriched T-lymphoctes are collected by centrifugation. 

In addition or alternatively, the buffy cells (whole buffy coat or sub-population, e.g. 
mononuclear fraction) can be cultured in vitro and subjected to stimulation with cytokines or activating 
chemicals such as phorbol esters or ionomycin. Such stimuli may increase expression of nucleotide 
sequences that are expressed in activated immune cells and might be of interest for leukocyte 
expression profiling experiments. 

Following sub-population selection and/or further treatment, e.g. stimulation as described 
above, RNA is prepared using standard methods. For example, cells are pelleted and lysed with a 
phenol/guanidinium thiocyanate and RNA is prepared. RNA can also be isolated using a silica gel- 
based purification column or the column method can be used on RNA isolated by the 
phenol/guanidinium thiocyanate method. RNA from individual buffy coat samples can be pooled 
during this process, so that the resulting reference RNA represents the RNA of many individuals and 
individual bias is imnimized or eliminated. In addition, a new batch of buffy coat reference RNA can 
be directly compared to the last batch to ensure similar expression pattern from one batch to another, 
using methods of collecting and comparing expression profiles described above/below. One or more 
expression reference controls are used in an experiment. For example, RNA derived from one or more 
of the following sources can be used as controls for an experiment: stimulated or unstimulated whole 
buffy coat, stimulated or unstimulated peripheral mononuclear cells, or stimulated or unstimulated T- 
lymphocytes. 

Alternatively, the expression reference standard can be derived from any subject or class of 
subjects including healthy subjects or subjects diagnosed with the same or a different disease or disease 
criterion. Expression profiles from subjects in two distinct classes are compared to determine which 
subset of nucleotide sequences in the candidate library best distinguish between the two subject classes, 
as further discussed below. It will be appreciated that in the present context, the term "distinct classes" 
is relevant to at least one distinguishable criterion relevant to a disease of interest, a "disease criterion." 
The classes can, of course, demonstrate significant overlap (or identity) with respect to other disease 
criteria, or with respect to disease diagnoses, prognoses, or the like. The mode of discovery involves, 
e.g., comparing the molecular signature of different subject classes to each other (such as patient to 
control, patients with a first diagnosis to patients with a second diagnosis, etc.) or by comparing the 
molecular signatures of a single individual taken at different time points. The invention can be applied 
to a broad range of diseases, disease criteria, conditions and other clinical and/or epidemiological 
questions, as further discussed above/below. 

It is appreciated that while the present discussion pertains to the use of expression reference 
controls while identifying diagnostic nucleotide sets, expression reference controls are also useful 
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during use of diagnostic nucleotide sets, e.g. use of a diagnostic nucleotide set for diagnosis of a 
disease, as further described below. 

Analysis of expression profiles 

In order to facilitate ready access, e.g., for comparison, review, recovery, and/or modification, 
the molecular signatures/expression profiles are typically recorded in a database. Most typically, the 
database is a relational database accessible by a computational device, although other formats, e.g., 
manually accessible indexed files of expression profiles as photographs, analogue or digital imaging 
readouts, spreadsheets, etc. can be used. Further details regarding preferred embodiments are provided 
below. Regardless of whether the expression patterns initially recorded are analog or digital in nature 
and/or whether they represent quantitative or qualitative differences in expression, the expression 
patterns, expression profiles (collective expression patterns), and molecular signatures (correlated 
expression patterns) are stored digitally and accessed via a database. Typically, the database is 
compiled and maintained at a central facility, with access being available locally and/or remotely. 

As additional samples are obtained, and their expression profiles determined and correlated 
with relevant subject data, the ensuing molecular signatures are likewise recorded in the database. 
However, rather than each subsequent addition being added in an essentially passive manner in which 
the data from one sample has little relation to data from a second (prior or subsequent) sample, the 
algorithms optionally additionally query additional samples against the existing database to further 
refine the association between a molecular signature and disease criterion. Furthermore, the data set 
comprising the one (or more) molecular signatures is optionally queried against an expanding set of 
additional or other disease criteria. The use of the database in integrated systems and web 
embodiments is further described below. 

Analysis of expression profile data from arrays 

Expression data is analyzed using methods well known in the art, including the software 
packages Imagene (Biodiscovery, Marina del Rey, CA), Feature Extraction Software (Agilent, Palo 
Alto, CA), and Scanalyze (Stanford University). In the discussion that follows, a "feature" refers to an 
individual spot of DNA on an array. Each gene may be represented by more than one feature. For 
example, hybridized microarrays are scanned and analyzed on an Axon Instruments scanner using 
GenePix 3.0 software (Axon Instruments, Union City, CA). The data extracted by GenePix is used for 
all downstream quality control and expression evaluation. The data is derived as follows. The data for 
all features flagged as "not found" by the software is removed from the dataset for individual 
hybridizations. The "not found" flag by GenePix indicates that the software was unable to discriminate 
the feature from the background. Each feature is examined to determine the value of its signal. The 
median pixel intensity of the background (B n ) is subtracted from the median pixel intensity of the 
feature (F n ) to produce the background-subtracted signal (hereinafter, "BGSS"). The BGSS is divided 
by the standard deviation of the background pixels to provide the signal-to-noise ratio (hereinafter, 
"S/N"). Features with a S/N of three or greater in both the Cy3 channel (corresponding to the sample 
RNA) and Cy5 channel (corresponding to the reference RNA) are used for further analysis (hereinafter 
denoted '^useable features"). Alternatively, different S/Ns are used for selecting expression data for an 
analysis. For example, only expression data with signal to noise ratios > 3 might be used in an 
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analysis. Alternatively, features with S/N values < 3 may be flagged as such and included in the 
analysis. Such flagged data sets include more values and may allow one to discover expression 
markers that would be missed otherwise. However, such data sets may have a higher variability than 
filtered data, which may decrease significance of findings or performance of correlation statistics. 

For each usable feature (i), the expression level (e) is expressed as the logarithm of the ratio 
(R) of the Background Subtracted Signal (hereinafter "BGSS") for the Cy3 (sample RNA) channel 
divided by the BGSS for the Cy5 channel (reference RNA). This "log ratio" value is used for 
comparison to other experiments. 



Ri = BGSS -+ (0.1) 
BGSS reference 



e, = log>; (0.2) 

Variation in signal across hybridizations may be caused by a number of factors affecting 
hybridization, DNA spotting, wash conditions, and labeling efficiency. 

A single reference RNA may be used with all of the experimental RNAs, permitting multiple 
comparisons in addition to individual comparisons. By comparing sample RNAs to the same 
reference, the gene expression levels from each sample are compared across arrays, permitting the use 
of a consistent denominator for our experimental ratios. 

Alternative methods of analyzing the data may involve 1) using the sample channel without 
normalization by the reference channel, 2) using an intensity-dependent normalization based on the 
reference which provides a greater correction when the signal in the reference channel is large, 3) using 
the data without background subtraction or subtracting an empirically derived function of the 
background intensity rather than the background itself. 
Scaling 

The data may be scaled (normalized) to control for labeling and hybridization variability 
within the experiment, using methods known in the art. Scaling is desirable because it facilitates the 
comparison of data between different experiments, patients, etc. Generally the BGSS are scaled to a 
factor such as the median, the mean, the trimmed mean, and percentile. Additional methods of scaling 
include: to scale between 0 and 1, to subtract the mean, or to subtract the median. 

Scaling is also performed by comparison to expression patterns obtained using a common 
reference RNA, as described in greater detail above. As with other scaling methods, the reference 
RNA facilitates multiple comparisons of the expression data, e.g., between patients, between samples, 
etc. Use of a reference RNA provides a consistent denominator for experimental ratios. 

In addition to the use of a reference RNA, individual expression levels may be adjusted to 
correct for differences in labeling efficiency between different hybridization experiments, allowing 
direct comparison between experiments with different overall signal intensities, for example. A scaling 
factor (a) may be used to adjust individual expression levels as follows. The median of the scaling 
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factor (a), for example, BGSS, is determined for the set of all features with a S/N greater than three. 
Next, the BGSSj (the BGSS for each feature "i") is divided by the median for all features (a), 
generating a scaled ratio. The scaled ration is used to determine the expression value for the feature 
(e,), or the log ratio. 



e ( . = log 



Cy3S i 



(0.4) 



CySS, 

In addition, or alternatively, control features are used to normalize the data for labeling and 
hybridization variability within the experiment. Control feature may be cDNA for genes from the 
plant, Arabidopsis th a liana, that are included when spotting the mini-array. Equal amounts of RNA 
complementary to control cDNAs are added to each of the samples before they were labeled. Using the 
signal from these control genes, a normalization constant (L) is determined according to the following 
formula: 



i=i 



N 



k H BGSS u 



7=1 



N 



where BGSSj is the signal for a specific feature, N is the number of A. thaliana control features, K is 
the number of hybridizations, and Lj is the normalization constant for each individual hybridization. 

Using the formula above, the mean for all control features of a particular hybridization and 
dye (e.g., Cy3) is calculated. The control feature means for all Cy3 hybridizations are averaged, and 
the control feature mean in one hybridization divided by the average of all hybridizations to generate a 
normalization constant for that particular Cy3 hybridization (Lj) 9 which is used as a in equation (0.3). 
The same normalization steps may be performed for Cy3 and Cy5 values. 

An alternative scaling method can also be used. The log of the ratio of Green/Red is 
determined for all features. The median log ratio value for all features is determined. The feature 
values are then scaled using the following formula: Log_Scaled_Feature_Ratio = Log u _Feature_Ratio - 
Median_Log_Ratio. 

Many additional methods for normalization exist and can be applied to the data. In one 
method, the average ratio of Cy3 BGSS / Cy5 BGSS is determined for all features on an array. This 
ratio is then scaled to some arbitrary number, such as 1 or some other number. The ratio for each probe 
is then multiplied by the scaling factor required to bring the average ratio to the chosen level. This is 
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performed for each array in an analysis. Alternatively, the ratios are normalized to the average ratio 
across all. arrays in an analysis. Other methods of normalization include forcing the distribution of 
signal strengths of the various arrays into greater agreement by transforming them to match certain 
points (quartiles, or deciles, etc.) in a standard distribution, or in the most extreme case using the rank 
of the signal of each oligonucleotide relative to the other oligonucleotides on the array. 

If multiple features are used per gene sequence or oligonucleotide, these repeats can be used to 
derive an average expression value for each gene. If some of the replicate features are of poor qualitay 
and don't meet requirements for analysis, the remaining features can be used to represent the gene or 
gene sequence. 

Correlation analysis 

Correlation analysis is performed to determine which array probes have expression behavior 
that best distinguishes or serves as markers for relevant groups of samples representing a particular 
clinical condition. Correlation analysis, or comparison among samples representing different disease 
criteria (e.g., clinical conditions), is performed using standard statistical methods. Numerous 
algorithms are useful for correlation analysis of expression data, and the selection of algorithms 
depends in part on the data analysis to be performed. For example, algorithms can be used to identify 
the single most informative gene with expression behavior that reliably classifies samples, or to 
identify all the genes useful to classify samples. Alternatively, algorithms can be applied that 
determine which set of 2 or more genes have collective expression behavior that accurately classifies 
samples. The use of multiple expression markers for diagnostics may overcome the variability in 
expression of a gene between individuals, or overcome the variability intrinsic to the assay. Multiple 
expression markers may include redundant markers (surrogates), in that two or more genes or probes 
may provide the same information with respect to diagnosis. This may occur, for example, when two 
or more genes or gene probes are coordinately expressed. For diagnostic application, it may be 
appropriate to utilize a gene and one or more of its surrogates in the assay. This redundancy may 
overcome failures (technical or biological) of a single marker to distinguish samples. Alternatively, 
one or more surrogates may have properties that make them more suitable for assay development, such 
as a higher baseline level of expression, better cell specificity, a higher fold change between sample 
groups or more specific sequence for the design of PCR primers or complimentary probes. It will be 
appreciated that while the discussion above pertains to the analysis of RNA expression profiles the 
discussion is equally applicable to the analysis of profiles of proteins or other molecular markers. 

Prior to analysis, expression profile data may be formatted or prepared for analysis using 
methods known in the art. For example, often the log ratio of scaled expression data for every array 
probe is calculated using the following formula: 

l°g (Cy 3 BOSS/ Cy5 BGSS), where Cy 3 signal corresponds to the expression of the gene in 
the clinical sample, and Cy5 signal corresponds to expression of the gene in the reference RNA. 

Data may be further filtered depending on the specific analysis to be done as noted below. For 
example, filtering may be aimed at selecting only samples with expression above a certain level, or 
. probes with variability above a certain level between sample sets. 
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The following non-limiting discussion consider several statistical methods known in the art. 
Briefly, the t-test and ANOVA are used to identify single genes with expression differences between or 
among populations, respectively. Multivariate methods are used to identify a set of two or more genes 
for which expression discriminates between two disease states more specifically than expression of any 
single gene. 

t-test 

The simpjest measure of a difference between two groups is the Student's t test. See, e.g., 
Welsh et al. (2001) Proc Natl Acad Sci USA 98:1 176-81 (demonstrating the use of an unpaired 
Student's t-test for the discovery of differential gene expression in ovarian cancer samples and control 
tissue samples). The t- test assumes equal variance and normally distributed data. This test identifies 
the probability that there is a difference in expression of a single gene between two groups of samples. 
The number of samples within each group that is required to achieve statistical significance is 
dependent upon the variation among the samples within each group. The standard formula for a t-test 
is: 



'(*,) = 



V(^Ac)+(4A,)' 



(0.5) 



where e, is the difference between the mean expression level of gene i in groups c and t, s ix is the 
variance of gene x in group c and s it , is the variance of gene x in group t. n c and n t are the numbers of 
samples in groups c and t. 

The combination of the t statistic and the degrees of freedom [min(n,, n c )-\] provides a p 
value, the probability of rejecting the null hypothesis. A p-value of £0.01, signifying a 99 percent 
probability the mean expression levels are different between the two groups (a 1% chance that the 
mean expression levels are in fact not different and that the observed difference occurred by statistical 
chance), is often considered acceptable. 

When performing tests on a large scale, for example, on a large dataset of about 8000 genes, a 
correction factor must be included to adjust for the number of individual tests being performed. The 
most common and simplest correction is the Bonferroni correction for multiple tests, which divides the 
p-value by the number of tests run. Using this test on an 8000 member dataset indicates that a p value 
of <0.00000125 is required to identify genes that are likely to be truly different between the two test 
conditions. 

Significance analysis for microarravs (SAM) 
Significance analysis for microarrays (SAM) (f usher 2001) is a method through which genes 
with a correlation between their expression values and the response vector are statistically discovered 
and assigned a statistical significance. The ratio of false significant to significant genes is the False 
Discovery Rate (FDR). This means that for each threshold there are a set of genes which are called 
significant, and the FDR gives a confidence level for this claim. If a gene is called differentially 
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expressed between 2 classes by SAM, with a FDR of 5%, there is a 95% chance that the gene is 
actually differentially expressed between the classes. SAM takes intoaccount the variability and large 
number of variables of microarrays. SAM will identiy genes that are most globally differentially 
expressed between the classes. Thus, important genes for identifying and classifying outlier samples or 
patients may not be identified by SAM. 
Non- Parametric Tests 

Wilcoxon's signed ranks method is one example of a non-parametric test and is utilized for 
paired comparisons. See e.g., Sokal and Rohlf (1987) Introduction to Biostatistics 2 nd edition, WH 
Freeman, New York. At least 6 pairs are necessary to apply this statistic. This test is useful for 
analysis of paired expression data (for example, a set of patients who have cardiac transplant biopsy on 
2 occasions and have a grade 0 on one occasion and a grade 3A on another). The Fisher Exact Test 
with a threshold and the Mann- Whitney Test are other non-parametric tests that may be used. 
ANQVA 

Differences in gene expression across multiple related groups may be assessed using an 
Analysis of Variance (ANOVA), a method well known in the art (Michelson and Schofield, 1996). 
Multivariate analysis 

Many algorithms suitable for multivariate analysis are known in the art. Generally, a set of 
two or more genes for which expression discriminates between two disease states more specifically 
than expression of any single gene is identified by searching through the possible combinations of 
genes using a criterion for discrimination, for example the expression of gene X must increase from 
normal 300 percent, while the expression of genes Y and Z must decrease from normal by 75 percent. 
Ordinarily, the search starts with a single gene, then adds the next best fit at each step of the search. 
Alternatively, the search starts with all of the genes and genes that do not aid in the discrimination are 
eliminated step-wise. 

Paired samples 

Paired samples, or samples collected at different time-points from the same patient, are often 
useful, as described above. For example, use of paired samples permits the reduction of variation due 
to genetic variation among individuals. In addition, the use of paired samples has a statistical 
significance, in that data derived from paired samples can be calculated in a different manner that 
recognizes the reduced variability. For example, the formula for a t-test for paired samples is: 



t(e x ) = °** = , (0.5) 

where D is the difference between each set of paired samples and b is "the number of sample pairs. 
D is the mean of the differences between the members of the pairs. In this test, only the differences 
between the paired samples are considered, then grouped together (as opposed to taking all possible 
differences between groups, as would be the case with an ordinary t-test): Additional statistical tests 
useful with paired data, e.g., ANOVA and Wilcoxon's signed rank test, are discussed above. 
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Diagnostic classification 

Once a discriminating set of genes is identified, the diagnostic classifier (a mathematical 
function that assigns samples to diagnostic categories based on expression data) is applied to unknown 
sample expression levels. 

Methods that can be used for this analysis include the following non-limiting list: 

CLEAVER is an algorithm used for classification of useful expression profile data. See 
Raychaudhuri et al. (2001) Trends Biotechnol 19:189-193. CLEAVER uses positive training samples 
(e.g., expression profiles from samples known to be derived from a particular patient or sample 
diagnostic category, disease or disease criteria), negative training samples (e.g., expression profiles 
from samples known not to be derived from a particular patient or sample diagnostic category, disease 
or disease criteria) and test samples (e.g., expression profiles obtained from a patient), and determines 
whether the test sample correlates with the particular disease or disease criteria, or does not correlate 
with a particular disease or disease criteria. CLEAVER also generates a list of the 20 most predictive 
genes for classification. 

Artificial neural networks (hereinafter, "ANN") can be used to recognize patterns in complex 
data sets and can discover expression criteria that classify samples into more than 2 groups. The use of 
artificial neural networks for discovery of gene expression diagnostics for cancers using expression 
data generated by oligonucleotide expression microarrays is demonstrated by Khan et al. (2001) Nature 
Med. 7:673-9. Khan found that 96 genes provided 0% error rate in classification of the tumors. The 
most important of these genes for classification was then determined by measuring the sensitivity of the 
classification to a change in expression of each gene. Hierarchical clustering using the 96 genes results 
in correct grouping of the cancers into diagnostic categories. 

Golub uses cDNA microarrays and a distinction calculation to identify genes with expression 
behavior that distinguishes myeloid and lymphoid leukemias. See Golub et al. (1999) Science 
286:531-7. Self organizing maps were used for new class discovery. Cross validation was done with a 
"leave one out" analysis. 50 genes were identified as useful markers. This was reduced to as few as 10 
genes with equivalent diagnostic accuracy. 

Hierarchical and non-hierarchical clustering methods are also useful for identifying groups of 
genes that correlate with a subset of clinical samples such as with transplant rejection grade. Alizadeh 
used hierarchical clustering as the primary tool to distinguish different types of diffuse B-cell 
lymphomas based on gene expression profile data. See Alizadeh et al. (2000) Nature 403:503-1 1. 
Alizadeh used hierarchical clustering as the primary tool to distinguish different types of diffuse B-cell 
lymphomas based on gene expression profile data. A cDNA array carrying 17856 probes was used for 
these experiments, 96 samples were assessed on 128 arrays, and a set of 380 genes was identified as 
being useful for sample classification. 

Perou demonstrates the use of hierarchical clustering for the molecular classification of breast 
tumor samples based on expression profile data. See Perou el al. (2000) Nature 406:747-52. In this 
work, a cDNA array carrying 8102 gene probes was used. 1753 of these genes were found to have 
high variation between breast tumors and were used for the analysis. 
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Hastie describes the use of gene shaving for discovery of expression markers. Hastie et al. 
(2000) Genome Biol. 1(2):RESEARCH 0003.1 -0003.2 1. The gene shaving algorithm identifies sets of 
genes with similar or coherent expression patterns, but large variation across conditions (RNA samples, 
sample classes,' patient classes)*. In this manner, genes with a tight expression pattern within a 
transplant rejection grade, but also with high variability across rejection grades are grouped together. 
The algorithm takes advantage of both characteristics in one grouping step. For example, gene shaving 
can identify useful marker genes with co-regulated expression. Sets of useful marker genes can be 
reduced to a smaller set, with each gene providing some non-redundant value in classification. This 
algorithm was used on the data set described in Alizadeh et al., supra, and the set of 380 informative 
gene markers was reduced to 234. 

Supervised harvesting of expression trees (Hastie 2001) identifies genes or clusters that best 
distinguish one class from all the others on the data set The method is used to identify the 
genes/clusters that can best separate one class versus all the others for datasets that include two or more 
classes or all classes from each other. This algorithm can be used for discovery or testing of a 
diagnostic gene set. 

CART is a decision tree classification algorithm (Breiman 1984). From gene expression and 
or other data, CART can develop a decision tree for the classification of samples. Each node on the 
decision tree involves a query about the expression level of one or more genes or variables. Samples 
that are above the threshold go down one branch of the decision tree and samples that are not go down 
the other branch. See example 4 for further description of its use in classification analysis and 
examples of its usefulness in discovering and implementing a diagnostic gene set. CART identifies 
surrogates for each splitter (genes that are the next best substitute for a useful gene in classification. 

Multiple Additive Regression Trees (Friedman, JH 1999, MART) is similar to CART in that it 
is a classification algorithm that builds decision trees to distinguish groups. MART builds numerous 
trees for any classification problem and the resulting model involves a combination of the multipte 
trees. MART can select variables as it build models and thus can be used on large data sets, such as 
those derived from an 8000 gene microarray. Because MART uses a combination of many trees and 
does not take too much information from any one tree, it resists over training. MART identifies a set of 
genes and an algorithm for their use as a classifier. 

A Nearest Shrunken Centroids Classifier can be applied to microarray or other data sets by the 
methods described by Tibshirani et al. 2002. This algorithms also identified gene sets for classification 
and determines their 10 fold cross validation error rates for each class of samples. The algorithm 
determines the error rates for models of any size, from one gene to all genes in the set. The error rates 
for either or both sample classes can are rninimized when a particular number of genes are used. When 
this gene number is determined, the algorithm associated with the selected genes can be identified and 
employed as a. classifier on prospective sample. 

Once a set of genes and expression criteria for those genes have been established for 
classification, cross validation is done. There are many approaches, including a 10 fold cross 
validation analysis in which 10% of the training samples are left out of the analysis and the 
classification algorithm is built with the remaining 90%. The 10% are then used as a test set for the 
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algorithm. The process is repeated 10 times with 10% of the samples being left out as a test set each 
time. Through this analysis, one can derive a cross validation error which helps estimate the robustness 
of the algorithm for use on prospective (test) samples. 

Clinical data are gathered for every patient sample used for expression analysis. Clinical 
variables can be quantitative or non-quantitative. A clinical variable that is quantitiative can be used as 
a variable for significance or classification analysis. Non-quantitative clinical variables, such as the sex 
of the patient, can also be used in a significance analysis or classification analysis with some statistical 
tool. It is appreciated that the most useful diagnostic gene set for a condition may be optimal when 
considered along with one or more predictive clinical variables. Clinical data can also be used as 
supervising vectors for a correlation analysis. That is to say that the clinical data associated with each 
sample can be used to divide the samples into meaningful diagnostic categories for analysis. For 
example, samples can be divided into 2 or more groups based on the presence or absence of some 
diagnostic criterion (a). In addition, clinical data can be utilized to select patients for a correlation 
analysis or to exclude them based on some undesirable characteristic, such as an ongoing infection, a 
medicine or some other issue. Clincial data can also be used to assess the pre-test probability of an 
outcome. For example, patients who are female are much more likely to be diagnosed as having 
systemic lupus erythematosis than patients who are male. 

Once a set of genes are identified that classify samples with acceptable accuracy. These genes 
are validated as a set using new samples that were not used to discover the gene set. These samples can 
be taken from frozen archieves from the discovery clinical study or can be taken from new patients 
prospectively. Validation using a "test set" of samples can be done using expression profiling of the 
gene set with microarrays or using real-time PCR for each gene on the test set samples. Alternatively, 
a different expression profiling technology can be used. 
Immune Monitoring 

Leukocyte gene expression can be used to monitor the immune system. Immune monitoring 
examines both the level of gene expression for a set of genes in a given cell type and for genes which 
are expressed in a cell type selective manner gene expression monitoring will also detect the presence 
or absence of new cell types, progenitor cells, differentiation of cells and the like. Gene expression 
patterns may be associated with activation or the resting state of cells of the immune system that are 
responsible for or responsive to a disease state. For example, in the process of transplant rejection, 
cells of the immune system are activated by the presence of the foreign tissue. Genes and gene sets that 
monitor and diagnose this process are providing a measure of the level and type of activation of the 
immune system. Genes and gene sets that are useful in monitoring the immune system may be useful 
for diagnosis and monitoring of all diseases that involve the immune system. Some examples are 
transplant rejection, rheumatoid arthritis, lupus, inflammatory bowel diseases, multiple sclerosis, 
HIV/AIDS, and viral, bacterial and fungal infection. All disorders and diseases disclosed herein are 
contemplated. Genes and gene sets that monitor immune activation are useful for monitoring response 
to immunosuppressive drug therapy, which is used to decrease immune activation. Genes are found to 
correlate with immune activation by correlation of expression patterns to the known presence of 
immune activation or quiescence in a sample as determined by some other test. 
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Selected Diseases 

In principle, diagnostic nucleotide sets of the invention may be developed and applied to 
essentially any disease, or disease criterion, as long as at least one subset of nucleotide sequences is 
differentially expressed in samples derived from one or more individuals with a disease criteria or 
disease and one or more individuals without the disease criteria or disease^ wherein the individual may 
be the same individual sampled at different points in time, or the individuals may be different 
individuals (or populations of individuals). For example, the subset of nucleotide sequences may be 
differentially expressed in the sampled tissues of subjects with the disease or disease criterion (e.g., a 
patient with a disease or disease criteria) as compared to subjects without the disease or disease 
criterion (e.g., patients without a disease (control patients)). Alternatively, or in addition, the subset of 
nucleotide sequence(s) may be differentially expressed in different samples taken from the same 
patient, e.g at different points in time, at different disease stages, before and after a treatment, in the 
presence or absence of a risk factor, etc. 

Expression profiles corresponding to sets of nucleotide sequences that correlate not with a 
diagnosis, but rather with a particular aspect of a disease can also be used to identify the diagnostic 
nucleotide sets and disease specific target nucleotide sequences of the invention. For example, such an 
aspect, or disease criterion, can relate to a subject's medical or family history, e.g., childhood illness, 
cause of death of a parent or other relative, prior surgery or other intervention, medications, symptoms 
(including onset and/or duration of symptoms), etc. Alternatively, the disease criterion can relate to a 
diagnosis, e.g., hypertension, diabetes, atherosclerosis, or prognosis (e.g., prediction of future 
diagnoses, events or complications), e.g., acute myocardial infarction, restenosis following angioplasty, 
reperfusion injury, allograft rejection, rheumatoid arthritis or systemic lupus erythematosis disease 
activity or the like. In other cases, the disease criterion corresponds to a therapeutic outcome, e.g., 
transplant rejection, bypass surgery or response to a medication, restenosis after stent implantation, 
collateral vessel growth due to therapeutic angiogenesis therapy, decreased angina due to 
revascularization, resolution of symptoms associated with a myriad of therapies, and the like. 
Alternatively, the disease criteria corresponds with previously identified or classic risk factors and may 
correspond to prognosis or future disease diagnosis. As indicated above, a disease criterion can also 
correspond to genotype for one or more loci. Disease criteria (including patient data) may be collected 
(and compared) from the same patient at different points in time, from different patients, between 
patients with a disease (criterion) and patients respresenting a control population, etc. Longitudinal 
data, i.e., data collected at different time points from an individual (or group of individuals) may be 
used for comparisons of samples obtained from an individual (group of individuals) at different points 
in time, to permit identification of differences specifically related to the disease state, and to obtain 
information relating to the change in expression over time, including a rate of change or trajectory of 
expression over time. The usefulness of longitudinal data is further discussed in the section titled 
"Identification of diagnostic nucleotide sets of the invention". 

It is further understood that diagnostic nucleotide sets may be developed for use in diagnosing 
conditions for which there is no present means of diagnosis. For example, in rheumatoid arthritis, joint 
destruction is often well under way before a patient experience symptoms of the condition. A 
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diagnostic nucleotide set may be developed that diagnoses rheumatic joint destruction at an earlier 
stage than would be possible using present means of diagnosis, which rely in part on the presentation of 
symptoms by a patient. Diagnostic nucleotide sets may also be developed to replace or augment 
current diagnostic procedures. For example, the use of a diagnostic nucleotide set to diagnose cardiac 
allograft rejection may replace the current diagnostic test, a graft biopsy. 

It is understood that the following discussion of diseases is exemplary and non-limiting, and 
further that the general criteria discussed above, e.g. use of family medical history, are generally 
applicable to the specific diseases discussed below. 

In addition to leukocytes, as described throughout, the general method is applicable to 
nucleotide sequences that are differentially expressed in any subject tissue or cell type, by the 
collection and assessment of samples of that tissue or cell type. However, in many cases, collection of 
such samples presents significant technical or medical problems given the current state of the art. 
Organ transplant refection and success 

A frequent complication of organ transplantation is recognition of the transplanted organ as 
foreign by the immune system resulting in rejection. Diagnostic nucleotide sets can be identified and 
validated for monitoring organ transplant success, rejection and treatment. Medications currently exist 
that suppress the immune system, and thereby decrease the rate of and severity of rejection. However, 
these drugs also suppress the physiologic immune responses, leaving the patient susceptible to a wide 
variety of opportunistic infections and cancers. At present there is no easy, reliable way to diagnose 
transplant rejection. Organ biopsy is the preferred method, but this is expensive, painful and associated 
with significant risk and has inadequate sensitivity for focal rejection. 

Diagnostic nucleotide sets of the present invention can be developed and validated for use as 
diagnostic tests for transplant rejection and success. It is appreciated that the methods of identifying 
diagnostic nucleotide sets are applicable to any organ transplant population. For example, diagnostic 
nucleotide sets are developed for cardiac allograft rejection and success. 

In some cases, disease criteria correspond to acute stage rejection diagnosis based on organ 
biopsy and graded using the International Society for Heart and Lung Transplantation ("ISHLT") 
criteria. This grading system classifies endomyocardial biopsies on the histological level as Grade 0, 
1 A, IB, 2, 3A, 3B, or 4. Grade 0 biopies have no evidence of rejection, while each successive grade 
has increased severity of leukocyte infiltration and/or damage to the graft myocardial cells. It is 
appreciated that there is variability in the Grading systems between medical centers and pathologists 
and between repeated readings of the same pathologist at different times. When using the biopsy grade 
as a disease criterion for leukocyte gene expression correlation analysis, it may be desirable to have a 
single pathologist read all biopsy slides or have multiple pathologists read all slides to determine the 
variability in this disease criterion. It is also appreciated that cardiac biopsy, in part due to variability, 
is not 100% sensitive or 100% specific for diagnosing acute rejection. When using the cardiac biopsy 
grade as a disease criterion for the discovery of diagnostic gene sets, it may be desirable to divide 
patient samples into diagnostic categories based on the grades. Examples of such classes are those 
patients with: Grade 0 vs. Grades 1 A-4, Grade 0 vs. Grades 1B-4, Grade 0 vs. Grades 2-4, Grade 0-1 
vs. Grade 2-4, Grade 0 : 1 vs. Grade 3 A-4, or Grade 0 vs. Grade 3 A-4. 
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Other disease criteria correspond to the cardiac biopsy results and other criteria, such as the 
results of cardiac function testing by echocardiography, hemodynamics assessment by cardiac 
catheterization, CMV infection, weeks post transplant, medication regimen, demographics and/or 
results of other diagnostic tests. 

Other disease criteria correspond to information from the patient's medical history and 
information regarding the organ donor. Alternatively, disease criteria include the presence or absence 
of cytomegalovirus (CMV) infection, Epstein-Barr virus (EBV) infection, allograft dysfunction 
measured by physiological tests of cardiac function (e.g., hemodynamic measurements from 
catheterization or echocardiograph data), and symptoms of other infections. Alternatively, disease 
criteria correspond to therapeutic outcome, e.g. graft failure, re-transplantation, death, hospitalization, 
need for intravenous immunosuppression, transplant vasculopathy, response to immunosuppressive 
medications, etc. Disease criteria may further correspond to a rejection episode of at least moderate 
histologic grade, which results in treatment of the patient with additional corticosteroids, anti-T cell 
antibodies, or total lymphoid irradiation; a rejection with histologic grade 2 or higher; a rejection with 
histologic grade <2; the absence of histologic rejection and normal or unchanged allograft function 
(based on hemodynamic measurements from catheterization or on echocardiographic data); the 
presence of severe allograft dysfunction or worsening allograft dysfunction during the study period 
(based on hemodynamic measurements from catheterization or on echocardiographic data).; 
documented CMV infection by culture, histology, or PCR, and at least one clinical sign or symptom of 
infection; specific graft biopsy rejection grades; rejection of mild to moderate histologic severity 
prompting augmentation of the patient's chronic immunosuppressive regimen; rejection of mild to 
moderate severity with allograft dysfunction prompting plasmapheresis or a diagnosis of "humoral" 
rejection; infections other than CMV, especially infection with Epstein Barr virus (EBV); 
lymphoproliferative disorder (also called post-transplant lymphoma); transplant vasculopathy 
diagnosed by increased intimal thickness on intravascular ultrasound (IVUS), angiography, or acute 
myocardial infarction; graft failure or retransplantation; and all cause mortality. Further specific 
examples of clinical data useful as disease criteria are provided in Example 3. 

In another example, diagnostic nucleotide sets are developed and validated for use in 
diagnosis and monitoring of kidney allograft recipients. Disease criteria correspond to, e.g., results of 
biopsy analysis for kidney allograft rejection, serum creatine level, creatinine clearance, radiological 
imaging results for the kidney and urinalysis results. Another disease criterion corresponds to the need 
for hemodialysis, retransplantation, death or other renal replacement therapy. Diagnostic nucleotide 
sets are developed and validated for use in diagnosis and treatment of bone marrow transplant and liver 
transplantation pateints, respectively. Disease criteria for bone marrow transplant correspond to the 
diagnosis and monitoring of graft rejection and/or graft versus host disease, the recurrence of cancer, 
complications due to immunosuppression, hematologic abnormalities, infection, hospitalization. and/or 
death. Disease criteria for liver transplant rejection include levels of serum markers for liver damage 
and liver function such as AST (aspartate aminotransferase), ALT (alanine aminotransferase), Alkaline 
phosphatase, GGT, (gamma-glutamyl transpeptidase) Bilirubin, Albumin and Prothrombin time. 
Further disease criteria correspond to hepatic encephalopathy, medication usage, ascites, graft failure, 
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retransplantation, hospitalization, complications of immunosuppression, results of diagnostic tests, 
results of radiological testing, death and histological rejection on graft biopsy. In addition, urine can be 
utilized for at the target tissue for profiling in renal transplant, while biliary and intestinal secretions 
and feces may be used favorably for hepatic or intestinal organ allograft rejection. Diagnostic 
nuclotide sets can also be discovered and developed for the diagnosis and monitoring of chronic renal 
allograft rejection . 

In the case of renal allografts, gene expression markers may be identified that are secreted 
proteins. These proteins may be detected in the urine of allograft recipients using standard 
immunoassays. Proteins are more likely to be present in the urine if they are of low molecular weight. 
Lower molecular weight proteins are more likely to pass through the glomerular membrane and into the 
urine. 

In another example, diagnostic nucleotide sets are developed and validated for use in 
diagnosis and treatment of xenograft recipients. This can include the transplantation of any organ from 
a non-human animal to a human or between non-human animals. Considerations for discovery and 
application of diagnostics and therapeutics and for disease criterion are substantially similar to those for 
allograft transplantation between humans. 

In another example, diagnostic nucleotide sets are developed and validated for use in 
diagnosis and treatment of artificial organ recipients. This includes, but is not limited to mechanical 
circulatory support, artificial hearts, left ventricular assist devices, renal replacement therapies, organ 
prostheses and the like. Disease criteria are thrombosis (blood clots), infection, death, hospitalization, 
and worsening measures of organ function (e.g., hemodynamics, creatinine, liver function testing, renal 
function testing, functional capacity). 

In another example, diagnostic nucleotide sets are developed and validated for use in matching 
donor organs to appropriate recipients. Diagnostic gene set can be discovered that correlate with 
successful matching of donor organ to recipient. Disease criteria include graft failure, acute and 
chronic rejection, death, hospitalization, immunosuppressive drug use, and complications of 
immunosuppression. Gene sets may be assayed from the donor or recipient's peripheral blood, organ 
tissue or some other tissue. 

In another example, diagnostic nucleotide sets are developed and validated for use in 
diagnosis and induction of patient immune tolerance (decrease rejection of an allograft by the host 
immune system). Disease criteria include rejection, assays of immune activation, need for 
immunosupression and all disease criteria noted above for transplantation of each organ. 
Viral diseases 

Diagnostic leukocyte nucleotide sets may be developed and validated for use in diagnosing 
viral disease, as well as diagnosing and monitoring transplant rejection. In another aspect, viral 
nucleotide sequences may be added to a leukocyte nucleotide set for use in diagnosis of viral diseases, 
as well as diagnosing and monitoring transplant rejection. Alternatively, viral nucleotide sets and 
leukocyte nucleotides sets may be used sequentially. 
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Epstein-Barr virus (EBV) 

EBV causes a variety of diseases such as mononucleosis, B-cell lymphoma, and pharyngeal 
carcinoma. It infects mononuclear cells and circulating atypical lymphocytes are a common 
manifestation of infection. Peripheral leukocyte gene expression is altered by infection. Transplant 
recipients and patients who are immunosuppressed are at increased risk for EBV-associated lymphoma. 

Diagnostic nucleotide sets may be developed and validated for use in diagnosis and 
monitoring of EBV, as well as diagnosing and monitoring transplant rejection. In one aspect, the 
diagnostic nucleotide set is a leukocyte nucleotide set. Alternatively, EBV nucleotide sequences are 
added to a leukocyte nucleotide set, for use in diagnosing EBV. Disease criteria correspond with 
diagnosis of EBV, and, in patients who are EBV-sero-positive, presence (or prospective occurrence ) of 
EBV-related illnesses such as mononucleosis, and EBV-associated lymphoma. Diagnostic nucleotide 
sets are useful for diagnosis of EBV, and prediction of occurrence of EBV-related illnesses. 
Cytomegalovirus (CMV) 

Cytomegalovirus cause inflammation and disease in almost any tissue, particularly the colon, 
lung, bone marrow and retina, and is a very important cause of disease in immunosuppressed patients, 
e.g. transplant, cancer, AIDS. Many patients are infected with or have been exposed to CMV, but not 
all patients develop clinical disease from the virus. Also, CMV negative recipients of allografts that 
come from CMV positive donors are at high risk for CMV infection. As immunosuppressive drugs are 
developed and used, it is increasingly important to identify patients with current or impending clinical 
CMV disease, because the potential benefit of immunosuppressive therapy must be balanced with the 
increased rate of clinical CMV infection and disease that may result from the use of 
immunosuppression therapy. CMV may also play a role in the occurrence of atherosclerosis or 
restenosis after angioplasty. CMV expression also correlates to transplant rejection, and is useful in 
diagnosing and monitoring transplant rejection. 

Diagnostic nucleotide sets are developed for use in diagnosis and monitoring of CMV 
infection or re-activation of CMV infection. In one aspect, the diagnostic nucleotide set is a leukocyte 
nucleotide set. In another aspect, CMV nucleotide sequences are added to a leukocyte nucleotide set, 
for use in diagnosing CMV. Disease criteria correspond to diagnosis of CMV (e.g., sero-positive state) 
and presence of clinically active CMV. Disease criteria may also correspond to prospective data, e.g. 
the likelihood that CMV will become clinically active or impending clinical CMV infection. Antiviral 
medications are available and diagnostic nucleotide sets can be used to select patients for early 
treatment, chronic suppression or prophylaxis of CMV activity. 
Hepatitis B and C 

These chronic viral infections affect about 1.25 and 2.7 million patients in the US, 
respectively. Many patients are infected, but suffer no clinical manifestations. Some patients with 
infection go on to suffer from chronic liver failure, cirrhosis and hepatic carcinoma. 

Diagnostic nucleotide sets are developed for use in diagnosis and monitoring of HBV or HCV 
infection. In one aspect, the diagnostic nucleotide set is a leukocyte nucleotide set. In another aspect, 
viral nucleotide sequences are added to a leukocyte nucleotide set, for use in diagnosing the virus and 
monitoring progression of liver disease. Disease criteria correspond to diagnosis of the virus (e.g., 
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sero-positive state or other disease symptoms). Alternatively, disease criteria correspond to liver 
damage, e.g., elevated alkaline phosphatase, ALT, AST or evidence of ongoing hepatic damage on 
liver biopsy. Alternatively, disease criteria correspond to serum liver tests (AST, ALT, Alkaline 
Phosphatase, GGT, PT, bilirubin), liver biopsy, liver ultrasound, viral load by serum PCR, cirrhosis, 
hepatic cancer, need for hospitalization or listing for liver transplant. Diagnostic nucleotide sets are 
used to diagnose HBV and HCV, and to predict likelihood of disease progression. Antiviral 
therapeutic usage, such as Interferon gamma and Ribavirin, can also be disease criteria. 
HIV 

HIV infects T cells and certainly causes alterations in leukocyte expression. Diagnostic 
nucleotide sets are developed for diagnosis and monitoring of HIV. In one aspect, the diagnostic 
nucleotide set is a leukocyte nucleotide set. In another aspect, viral nucleotide sequences are added to a 
leukocyte nucleotide set, for use in diagnosing the virus. Disease criteria correspond to diagnosis of the 
virus (e.g., sero-positive state). In addition, disease criteria correspond to viral load, CD4 T cell counts, 
opportunistic infection, response to antiretroviral therapy, progression to AIDS, rate of progression and 
the occurrence of other HIV related outcomes (e.g., malignancy, CNS disturbance). Response to 
antiretrovirals may also be disease criteria. 
Pharmacozenomics 

Pharmocogenomics is the study of the individual propensity to respond to a particular drug 
therapy (combination of therapies). In this context, response can mean whether a particular drug will 
work on a particular patient, e.g. some patients respond to one drug but not to another drug. Response 
can also refer to the likelihood of successful treatment or the assessment of progress in treatment. 
Titration of drug therapy to a particular patient is also included in this description, e.g. different 
patients can respond to different doses of a given medication. This aspect may be important when 
drugs with side-effects or interactions with other drug therapies are contemplated. 

Diagnostic nucleotide sets are developed and validated for use in assessing whether a patient 
will respond to a particular therapy and/or monitoring response of a patient to drug therapy(therapies). 
Disease criteria correspond to presence or absence of clinical symptoms or clinical endpoints, presence 
of side-effects or interaction with other drug(s). The diagnostic nucleotide set may further comprise 
nucleotide sequences that are targets of drug treatment or markers of active disease. 
Validation and accuracy of diagnostic nucleotide sets 

Prior to widespread application of the diagnostic probe sets of the invention the predictive 
value of the probe set is validated. When the diagnostic probe set is discovered by microarray based 
expression analysis, the differential expression of the member genes may be validated by a less variable 
and more quantitive and accurate technology such as real time PCR. In this type of experiment the 
amplification product is measured during the PCR reaction. This enables the researcher to observe the 
amplification before any reagent becomes rate limiting for amplification. In kinetic PCR the 
measurement is of C T (threshold cycle) or C P (crossing point). This measurement (Gr=C P ) is the point 
at which an amplification curve crosses a threshold fluorescence value. The threshold is set to a point 
within the area where all of the reactions were in their linear phase of amplification. When measuring 
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C T , a lower C T value is indicative of a higher amount of starting material since an earlier cycle number 
means the threshold was crossed more quickly. 

Several fluorescence methodologies are available to measure amplification product in real- 
time PGR. Taqman (Applied BioSystems, Foster City, CA) uses fluorescence resonance energy 
transfer (FRET) to inhibit signal from a probe until the probe is degraded by the sequence specific 
binding and Taq 3'exonuclease activity. Molecular Beacons (Stratagene, La Jolla, CA) also use FRET 
technology, whereby the fluorescence is measured when a hairpin structure is relaxed by the specific 
probe binding to the amplified DNA. The third commonly used chemistry is Sybr Green, a DNA- 
binding dye (Molecular Probes, Eugene, OR). The more amplified product that is produced, the higher 
the signal. The Sybr Green method is sensitive to non-specific amplification products, increasing the 
importance of primer design and selection. Other detection chemistries can also been used, such as 
ethedium bromide or other DNA-binding dyes and many modifications of the fluorescent dye/quencher 
dye Taqman chemistry, for example scorpions. 

Real-time PCR validation can be done as described in Example 12. 

Typically, the oligonucleotide sequence of each probe is confirmed, e.g. by DNA sequencing 
using an oligonucleotide-specific primer. Partial sequence obtained is generally sufficient to confirm 
the identity of the oligonucleotide probe. Alternatively, a complementary polynucleotide is 
fluorescently labeled and hybridized to the array, or to a different array containing a resynthesized 
version of the oligo nucleotide probe, and detection of the correct probe is confirmed. 

Typically, validation is performed by statistically evaluating the accuracy of the 
correspondence between the molecular signature for a diagnostic probe set and a selected indicator. 
For example, the expression differential for a nucleotide sequence between two subject classes can be 
expressed as a simple ratio of relative expression. The expression of the nucleotide sequence in 
subjects with selected indicator can be compared to the expression of that nucleotide sequence in 
subjects without the indicator, as described in the following equations. 

ZE x ai/N = E X A the average expression of nucleotide sequence x in the members of group A; 
EE x bi/M = E X B the average expression of nucleotide sequence x in the members of group B; 
E X AJ ExB =AE X AB the average differential expression of nucleotide sequence x between groups 

A and B: 

where S indicates a sum; Ex is the expression of nucleotide sequence x relative to a standard; ai are the 
individual members of group A, group A has N members; bi are the individual members of group B, 
group B has M members. 

The expression of at least two nucleotide sequences, e.g., nucleotide sequence X and 
nucleotide sequence Y are measured relative to a standard in at least one subject of group A (e.g., with 
a disease) and group B (e.g., without the disease). Ideally, for purposes of validation the indicator is 
independent from (i.e., not assigned based upon) the expression pattern; Alternatively, a minimum 
threshold of gene expression for nucleotide sequences X and Y, relative to the standard, are designated 
for assignment to group A. For nucleotide sequence x, this threshold is designated AEx, and for 
nucleotide sequence y, the threshold is designated AEy. 

The following formulas are used in the calculations below: 
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Sensitivity = (true positives/true positives + false negatives) 
Specificity = (true negatives/true negatives + false positives) 
]f, for example, expression of nucleotide sequence x above a threshold: x > AEx, is observed 
for 80/100 subjects in group A and for 10/100 subjects in group B, the sensitivity of nucleotide 
sequence x for the assignment to group A, at the given expression threshold AEx, is 80%, and the 
specificity is 90%. 

If the expression of nucleotide sequence y is > AEy in 80/100 subjects in group A, and in 
10/100 subjects in group B, then, similarly the sensitivity of nucleotide sequence y for the assignment 
to group A at the given threshold AEy is 80% and the specificity is 90%. If in addition, 60 of the 80 
subjects in group A that meet the expression threshold for nucleotide sequence y also meet the 
expression threshold AEx and that 5 of the 10 subjects in group B that meet the expression threshold 
for nucleotide sequence y also meet the expression threshold AEx, the sensitivity of the test (x>AEx 
and y>AEy)for assignment of subjects to group A is 60% and the specificity is 95%. 

Alternatively, if the criteria for assignment to group A are change to: Expression of x > AEx 
or expression of y > AEy, the sensitivity approaches 100% and the specificity is 85%. 

Clearly, the predictive accuracy of any diagnostic probe set is dependent on the minimum 
expression threshold selected. The expression of nucleotide sequence X (relative to a standard) is 
measured in subjects of groups A (with disease) and B (without disease). The minimum threshold of 
nucleotide sequence expression for x, required for assignment to group A is designated AEx I. 

If 90/100 patients in group A have expression of nucleotide sequence x > AEx 1 and 20/100 
patients in group B have expression of nucleotide sequence x > AEx 1, then the sensitivity of the 
expression of nucleotide sequence x (using AEx 1 as a minimum expression threshold) for assignment 
of patients to group A will be 90% and the specificity will be 80%. 

Altering the minimum expression threshold results in an alteration in the specificity and 
sensitivity of the nucleotide sequences in question. For example, if the minimum expression threshold 
of nucleotide sequence x for assignment of subjects to group A is lowered to AEx 2, such that 100/100 
subjects in group A and 40/100 subjects in group B meet the threshold, then the sensitivity of the test 
for assignment of subjects to group A will be 100% and the specificity will be 60%. 

Thus, for 2 nucleotide sequences X and Y: the expression of nucleotide sequence x and 
nucleotide sequence y (relative to a standard) are measured in subjects belonging to groups A (with 
disease) and B (without disease). Minimum thresholds of nucleotide sequence expression for 
nucleotide sequences X and Y (relative to common standards) are designated for assignment to group 
A. For nucleotide sequence x, this threshold is designated AExl and for nucleotide sequence y, this 
threshold is designated AEyl. 

If in group A, 90/100 patients meet the minimum requirements of expression AExl and AEyl , 
and in gr° u P B > 10/100 subjects meet the minimum requirements of expression AExl and AEyl, then 
the sensitivity of the test for assignment of subjects to group A is 90% and the specificity is 90%. 

Increasing the minimum expression thresholds for X and Y to AEx2 and AEy2, such that in 
group A, 70/100 subjects meet the minimum requirements of expression AEx2 and AEy2, and in group 
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B, 3/100 subjects meet the minimum requirements of expression AEx2 and AEy2. Now the sensitivity 
of the test for assignment of subjects to group A is 70% and the specificity is 97%. 

If the criteria for assignment to group A is that the subject in question meets either threshold, 
AEx2 or AEy2, and it is found that 100/100 subjects in group A meet the criteria and 20/100 subjects in 
group B meet the criteria, then the sensitivity of the test for assignment to group A is 100% and the 
specificity is 80%. 

Individual components of a diagnostic probe set each have a defined sensitivity and specificity 
for distinguishing between subject groups. Such individual nucleotide sequences can be employed in 
concert as a diagnostic probe set to increase the sensitivity and specificity of the evaluation. The 
database of molecular signatures is queried by algorithms to identify the set of nucleotide sequences 
(i.e., corresponding to members of the probe set) with the highest average differential expression 
between subject groups. Typically, as the number of nucleotide sequences in the diagnostic probe set 
increases, so does the predictive value, that is, the sensitivity and specificity of the probe set. When the 
probe sets are defined they may be used for diagnosis and patient monitoring as discussed below. The 
diagnostic sensitivity and specificity of the probe sets for the defined use can be determined for a given 
probe set with specified expression levels as demonstrated above. By altering the expression threshold 
required for the use of each nucleotide sequence as a diagnostic, the sensitivity and specificity of the 
probe set can be altered by the practitioner. For example, by lowering the magnitude of the expression 
differential threshold for each nucleotide sequence in the set, the sensitivity of the test will increase, but 
the specificity will decrease. As is apparent from the foregoing discussion, sensitivity and specificity 
are inversely related and the predictive accuracy of the probe set is continuous and dependent on the 
expression threshold set for each nucleotide sequence. Although sensitivity and specificity tend to 
have an inverse relationship when expression thresholds are altered, both parameters can be increased 
as nucleotide sequences with predictive value are added to the diagnostic nucleotide set. In addition a 
single or a few markers may not be reliable expression markers across a population of patients. This is 
because of the variability in expression and measurement of expression that exists between 
measurements, individuals and individuals over time. Inclusion of a large number of candidate 
nucleotide sequences or large numbers of nucleotide sequences in a diagnostic nucleotide set allows for 
this variability as not all nucleotide sequences need to meet a threshold for diagnosis. Generally, more 
markers are better than a single marker. If many markers are used to make a diagnosis, the likelihood 
that all expression markers will not meet some thresholds based upon random variability is low and 
thus the test will give fewer false negatives. 

It is appreciated that the desired diagnostic sensitivity and specificity of the diagnostic 
nucleotide set may vary depending on the intended use of the set. For example, in certain uses, high 
specificity and high sensitivity are desired. For example, a diagnostic nucleotide set for predicting 
which patient population may experience side effects may require high sensitivity so as to avoid 
treating such patients. In other settings, high sensitivity is desired, while reduced specificity may be 
tolerated. For example, in the case of a beneficial treatment with few side effects, it may be important 
to identify as many patients as possible (high sensitivity) who will respond to the drug, and treatment 
of some patients who will not respond is tolerated. In other settings, high specificity is desired and 
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reduced sensitivity may be tolerated. For example, when identifying patients for an early-phase clinical 
trial, it is important to identify patients who may respond to the particular treatment. Lower sensitivity 
is tolerated in this setting as it merely results in reduced patients who enroll in the study or requires that 
more patients are screened for enrollment. 

Methods of using diagnostic nucleotide sets. 
The invention also provide methods of using the diagnostic nucleotide sets to: diagnose 
disease; assess severity of disease; predict future occurrence of disease; predict future complications of 
disease; determine disease prognosis; evaluate the patient's risk, or "stratify" a group of patients; assess 

r 

response to current drug therapy; assess response to current non-pharmacological therapy; determine 
the most appropriate medication or treatment for the patient; predict whether a patient is likely to 
respond to a particular drug; and determine most appropriate additional diagnostic testing for the 
patient, among other clinically and epidemiologically relevant applications. 

The nucleotide sets of the invention can be utilized for a variety of purposes by physicians, 
healthcare workers, hospitals, laboratories, patients, companies and other institutions. As indicated 
previously, essentially any disease, condition, or status for which at least one nucleotide sequence is 
differentially expressed in leukocyte populations (or sub-populations) can be evaluated, e.g., diagnosed, 
monitored, etc. using the diagnostic nucleotide sets and methods of the invention. In addition to 
assessing health status at an individual level, the diagnostic nucleotide sets of the present invention are 
suitable for evaluating subjects at a "population level," e.g., for epidemiological studies, or for 
population screening for a condition or disease. 

Collection and preparation of sample 

RNA, protein and/or DNA is prepared using methods well-known in the art, as further 
described herein. It is appreciated that subject samples collected for use in the methods of the 
invention are generally collected in a clinical setting, where delays may be introduced before RNA 
samples are prepared from the subject samples of whole blood, e.g. the blood sample may not be 
promptly delivered to the clinical lab for further processing. Further delay may be introduced in the 
clinical lab setting where multiple samples are generally being processed at any given time. For this 
reason, methods which feature lengthy incubations of intact leukocytes at room temperature are not 
preferred, because the expression profile of the leukocytes may change during this extended time 
period. For example, RNA can be isolated from whole blood using a phenol/guanidine isothiocyanate 
reagent or another direct whole-blood lysis method, as described in, e.g., U.S. Patent Nos. 5,346,994 
and 4,843,155. This method may be less preferred under certain circumstances because the large 
majority of the RNA recovered from whole blood RNA extraction comes from erythrocytes since these 
cells outnumber leukocytes 1000: 1 . Care must be taken to ensure that the presence of erythrocyte RNA 
and protein does not introduce bias in the RNA expression profile data or lead to inadequate sensitivity 
or specificity of probes. 

Alternatively, intact leukocytes may be collected from whole blood using a lysis buffer that 
selectively lyses erythrocytes, but not leukocytes, as described, e.g., in (U.S. Patent Nos. 5,973,137, 
and 6,020,186). Intact leukocytes are then collected by centrifugation, and leukocyte RNA is isolated 
using standard protocols, as described herein. However, this method does not allow isolation of sub- 
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populations of leukocytes, e.g. mononuclear cells, which may be desired. In addition, the expression 
profile may change during the lengthy incubation in lysis buffer, especially in a busy clinical lab where 
large numbers of samples are being prepared at any given time. 

Alternatively, specific leukocyte cell types can be separated using density gradient reagents 
(Boyum, A, 1968.)- For example, mononuclear cells may be separated from whole blood using density 
gradient centrifugation, as described, e.g., in U.S. Patents Nos. 4190535, 4350593, 4751001, 4818418, 
and 5053134. Blood is drawn directly into a tube containing an anticoagulant and a density reagent 
(such as Ficoll or Percoll). Centrifugation of this tube results in separation of blood into an erythrocyte 
and granulocyte layer, a mononuclear cell suspension, and a plasma layer. The mononuclear cell layer 
is easily removed and the cells can be collected by centrifugation, lysed, and frozen. Frozen samples 
are stable until RNA can be isolated. Density centrifugation, however, must be conducted at room 
temperature, and if processing is unduly lengthy, such as in a busy clinical lab, the expression profile 
may change. 

Alternatively, cells can be separated using fluorescence activated cell sorting (FACS) or some 
other technique, which divides cells into subsets based on gene or protein expression. This may be 
desirable to enrich the sample for cells of interest, but it may also introduce cell manipulations and time 
delays, which result in alteration of gene expression profiles (Cantor et al. 1975; Galbraith et al. 1999). 

The quality and quantity of each clinical RNA sample is desirably checked before 
amplification and labeling for array hybridization, using methods known in the art. For example, one 
microliter of each sample may be analyzed on a Bioanalyzer (Agilent 2100 Palo Alto, CA. USA) using 
an RNA 6000 nano LabChip (Caliper, Mountain View, CA. USA). Degraded RNA is identified by the 
reduction of the 28S to 18S ribosomal RNA ratio and/or the presence of large quantities of RNA in the 
25-100 nucleotide range. 

It is appreciated that the RNA sample for use with a diagnostic nucleotide set may be 
produced from the same or a different cell population, sub-population and/or cell type as used to 
identify the diagnostic nucleotide set. For example, a diagnostic nucleotide set identified using RNA 
extracted from mononuclear cells may be suitable for analysis of RNA extracted from whole blood or 
mononuclear cells, depending on the particular characteristics of the members of the diagnostic 
nucleotide set. Generally, diagnostic nucleotide sets must be tested and validated when used with RNA ■ 
derived from a different cell population, sub-population or cell type than that used when obtaining the 
diagnostic gene set. Factors such as the cell-specific gene expression of diagnostic nucleotide set 
members, redundancy of the information provided by members of the diagnostic nucleotide set, 
expression level of the member of the diagnostic nucleotide set, and cell-specific alteration of 
expression of a member of the diagnostic nucleotide set will contribute to the usefullness of using a 
different RNA source than that used when identifying the members of the diagnostic nucleotide set. It 
is appreciated that it may be desirable to assay RNA derived from whole blood, obviating the need to 
isolate particular cell types from the blood. 
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Rapid method ofRNA extraction suito hlr for pro duction in a clinical setting of hieh quality 

RNA for expression profiling 
In a clinical setting, obtaining high quality RNA preparations suitable for expression profiling, 
from a desired population of leukocytes poses certain technical challenges, including: the lack of 
capacity for rapid, high-throughput sample processing in the clinical setting, and the possibility that 
delay in processing (in a busy lab or in the clinical setting) may adversely affect RNA quality, e.g. by a 
permitting the expression profile of certain nucleotide sequences to shift. Also, use of toxic and 
expensive reagents, such as phenol, may be disfavored in the clinical setting due to the added expense 
associated with shipping and handling such reagents. 

A useful method for RNA isolation for leukocyte expression profiling would allow the 
isolation of monocyte and lymphocyte RNA in a timely manner, while preserving the expression 
profiles of the cells, and allowing inexpensive production of reproducible high-quality RNA samples. 
Accordingly, the invention provides a method of adding inhibitor(s) of RNA transcription and/or 
inhibitors) of protein synthesis, such that the expression profile is "frozen" and RNA degradation is 
reduced. A desired leukocyte population or sub-population is then isolated, and the sample may be 
frozen or lysed before further processing to extract the RNA. Blood is drawn from subject population 
and exposed to ActinomycinD (to a final concentration of 10 ug/ml) to inhibit transcription, and 
cycloheximide (to a final concentration of 10 ug/ml) to inhibit protein synthesis. The inhibitor(s) can 
be injected into the blood collection tube in liquid form as soon as the blood is drawn, or the tube can 
be manufactured to contain either lyophilized inhibitors or inhibitors that are in solution with the 
anticoagulant. At this point, the blood sample can be stored at room temperature until the desired 
leukocyte population or sub-population is isolated, as described elsewhere. RNA is isolated using 
standard methods, e.g., as described above, or a cell pellet or extract can be frozen until further 
processing of RNA is convenient. 

The invention also provides a method of using a low-temperature density gradient for 
separation of a desired leukocyte sample. In another embodiment, the invention provides the 
combination of use of a low-temperature density gradient and the use of transcriptional and/or protein 
synthesis inhibitor(s). A desired leukocyte population is separated using a density gradient solution for 
cell separation that maintains the required density and viscosity for cell separation at 0-4°C. Blood is 
drawn into a tube containing this solution and may be refrigerated before and during processing as the 
low temperatures slow cellular processes and minimize expression profile changes. Leukocytes are 
separated, and RNA is isolated using standard methods. Alternately, a cell pellet or extract is frozen 
until further processing of RNA is convenient. Care must be taken to avoid rewarming the sample 
during further processing steps. 

Alternatively, the invention provides a method of using low-temperature density gradient 
separation, combined with the use of actinomycin A and cyclohexamide, as described above. 
Assessing expression for diagnostics 
Expression profiles for the set of diagnostic nucleotide sequences in a subject sample can be 
evaluated by any technique that deteirmines the expression of each component nucleotide sequence. 
Methods suitable for expression analysis are known in the art, and numerous examples are discussed in 
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the Sections titled "Methods of obtaining expression data" and "high throughput expression Assays", 
above. 

In many cases, evaluation of expression profiles is most efficiently, and cost effectively, 
performed by analyzing RNA expression. Alternatively, the proteins encoded by each component of 
the diagnostic nucleotide set are detected for diagnostic purposes by any technique capable of 
determining protein expression, e.g., as described above. Expression profiles can be assessed in subject 
leukocyte sample using the same or different techniques as those used to identify and validate the 
diagnostic nucleotide set. For example, a diagnostic nucleotide set identified as a subset of sequences 
on a cDNA microarray can be utilized for diagnostic (or prognostic, or monitoring, etc.) purposes on 
the same array from which they were identified. Alternatively, the diagnostic nucleotide sets for a 
given disease or condition can be organized onto a dedicated sub-array for the indicated purpose. It is 
important to note that if diagnostic nucleotide sets are discovered using one technology, e.g. RNA 
expression profiling, but applied as a diagnostic using another technology, e.g. protein expression 
profiling, the nucleotide sets must generally be validated for diagnostic purposes with the new 
technology. In addition, it is appreciated that diagnostic nucleotide sets that are developed for one use, 
e.g. to diagnose a particular disease, may later be found to be useful for a different application, e.g. to 
predict the likelihood that the particular disease will occur. Generally, the diagnostic nucleotide set 
will need to be validated for use in the second circumstance. As discussed herein, the sequence of 
diagnostic nucleotide set members may be amplified from RNA or cDNA using methods known in the 
art providing specific amplification of the nucleotide sequences. 
General Protein Methods 

Protein products of the nucleotide sequences of the invention may include proteins that 
represent functionally equivalent gene products. Such an equivalent gene product may contain 
deletions, additions or substitutions of amino acid residues within the amino acid sequence encoded by 
the nucleotide sequences described, above, but which result in a silent change, thus producing a 
functionally equivalent nucleotide sequence product. Amino acid substitutions may be made on the 
basis of similarity in polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the amphipathic 
nature of the residues involved. 

For example, nonpolar (hydrophobic) amino acids include alanine, leucine, isoleucine, valine, 
proline, phenylalanine, tryptophan, and methionine; polar neutral amino acids include glycine, serine, 
threonine, cysteine, tyrosine, asparagine, and glutamine; positively charged (basic) amino acids include 
arginine, lysine, and histidine; and negatively charged (acidic) amino acids include aspartic acid and 
glutamic acid. "Functionally equivalent", as utilized herein, refers to a protein capable of exhibiting a 
substantially similar in vivo activity as the endogenous gene products encoded by the nucleotide 
described, above. 

The gene products (protein products of the nucleotide sequences) may be produced by 
recombinant DNA technology using techniques well known in the art. Thus, methods for preparing the 
gene polypeptides and peptides of the invention by expressing nucleic acid encoding nucleotide 
sequences are described herein. Methods which are well known to those skilled in the art can be used 
to construct expression vectors containing nucleotide sequence protein coding sequences and 
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appropriate transcriptional/translational control signals. These methods include, for example, in vitro 
recombinant DNA techniques, synthetic techniques and in vivo recombination/genetic recombination. 
See, for example, the techniques described in Sambrook et al., 1989, supra, and Ausubel et al., 1989, 
supra. Alternatively, RNA capable of encoding nucleotide sequence protein sequences may be 
chemically synthesized using, for example, synthesizers. See, for example, the techniques described in 
"Oligonucleotide Synthesis", 1984, Gait, M. J. ed., IRL Press, Oxford, which is incorporated by 
reference herein in its entirety 

A variety of host-expression vector systems may be utilized to express the nucleotide 
sequence coding sequences of the invention. Such host-expression systems represent vehicles by 
which the coding sequences of interest may be produced and subsequently purified, but also represent 
cells which may, when transformed or transfected with the appropriate nucleotide coding sequences, 
exhibit the protein encoded by the nucleotide sequence of the invention in situ. These include but are 
not limited to microorganisms such as bacteria (e.g., E. coli, B. subtilis) transformed with recombinant 
bacteriophage DNA, plasmid DNA or cosmid DNA expression vectors containing nucleotide sequence 
protein coding sequences; yeast (e.g. Saccharomyces, Pichia) transformed with recombinant yeast 
expression vectors containing the nucleotide sequence protein coding sequences; insect cell systems 
infected with recombinant virus expression vectors (e.g., baculovirus) containing the nucleotide 
sequence protein coding sequences; plant cell systems infected with recombinant virus expression 
vectors (e.g., cauliflower mosaic virus, CaMV; tobacco mosaic virus, TMV) or transformed with 
recombinant plasmid expression vectors (e.g., Ti plasmid) containing nucleotide sequence protein 
coding sequences; or mammalian cell systems (e.g. COS, CHO, BHK, 293, 3T3) harboring 
recombinant expression constructs containing promoters derived from the genome of mammalian cells 
(e.g., metallothionein promoter) or from mammalian viruses (e.g., the adenovirus late promoter; the 
vaccinia virus 7.5 K promoter). 

In bacterial systems, a number of expression vectors may be advantageously selected 
depending upon the use intended for the nucleotide sequence protein being expressed. For example, 
when a large quantity of such a protein is to be produced, for the generation of antibodies or to screen 
peptide libraries, for example, vectors which direct the expression of high levels of fusion protein 
products that are readily purified may be desirable. Such vectors include, but are not limited, to the E. 
coli expression vector pUR278 (Ruther et al., 1983, EMBO J. 2:1791), in which the nucleotide 
sequence protein coding sequence may be ligated individually into the vector in frame with the lac Z 
coding region so that a fusion protein is produced; pIN vectors (Inouye & Inouye, 1985, Nucleic Acids 
Res. 13:3101-3109; Van Heeke & Schuster, 1989, J. Biol. Chem. 264:5503-5509); and the likes of 
pGEX vectors may also be used to express foreign polypeptides as fusion proteins with glutathione S- 
transferase (GST). In general, such fusion proteins are soluble and can easily be purified from lysed 
cells by adsorption to glutathione-agarose beads followed by elution in the presence of free glutathione. 
The pGEX vectors are designed to include thrombin or factor Xa protease cleavage sites so that the 
cloned target nucleotide sequence protein can be released from the GST moiety. Other systems useful 
in the invention include use of the FLAG epitope or the 6-HIS systems. 
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In an insect system, Autographa califomica nuclear polyhedrosis virus ( AcNPV) is used as a 
vector to express foreign nucleotide sequences. The virus grows in Spodoptera frugiperda cells. The 
nucleotide sequence coding sequence may be cloned individually into non-essential regions (for 
example the polyhedrin gene) of the virus and placed under control of an AcNPV promoter (for 
example the polyhedrin promoter). Successful insertion of nucleotide sequence coding sequence will 
result in inactivation of the polyhedrin gene and production of non-occluded recombinant virus (i.e., 
virus lacking the proteinaceous coat coded for by the polyhedrin gene). These recombinant viruses are 
then used to infect Spodoptera frugiperda cells in which the inserted nucleotide sequence is expressed. 
(E.g., see Smith et ah, 1983, J. Virol. 46: 584; Smith, U.S. Pat. No. 4,215,051). 

In rrmrnmalian host cells, a number of viral-based expression systems may be utilized. In 
cases where an adenovirus is used as an expression vector, the nucleotide sequence coding sequence of 
interest may be ligated to an adenovirus transcription/translation control complex, e.g., the late 
promoter and tripartite leader sequence. This chimeric nucleotide sequence may then be inserted in the 
adenovirus genome by in vitro or in vivo recombination. Insertion in a non-essential region of the viral 
genome (e.g., region El or E3) will result in a recombinant virus that is viable and capable of 
expressing nucleotide sequence encoded protein in infected hosts. (E.g., See Logan & Shenk, 1984, 
Proc. Natl. Acad. Sci. USA 8 1 :3655-3659). Specific initiation signals may also be required for 
efficient translation of inserted nucleotide sequence coding sequences. These signals include the ATG 
initiation codon and adjacent sequences. In cases where an entire nucleotide sequence, including its 
own initiation codon and adjacent sequences, is inserted into the appropriate expression vector, no 
additional translational control signals may be needed. However, in cases where only a portion of the 
nucleotide sequence coding sequence is inserted, exogenous translational control signals, including, 
perhaps, the ATG initiation codon, must be provided. Furthermore, the initiation codon must be in 
phase with the reading frame of the desired coding sequence to ensure translation of the entire insert. 
These exogenous translational control signals and initiation codons can be of a variety of origins, both 
natural and synthetic. The efficiency of expression may be enhanced by the inclusion of appropriate 
transcription enhancer elements, transcription terminators, etc. (see Bittner et al., 1987, Methods in 
Enzymol. 153:516-544). 

In addition, a host cell strain may be chosen which modulates the expression of the inserted 
sequences, or modifies and processes the product of the nucleotide sequence in the specific fashion 
desired. Such modifications (e.g., glycosylation) and processing (e.g., cleavage) of protein products 
may be important for the function of the protein. Different host cells have characteristic and specific 
mechanisms for the post- translational processing and modification of proteins. Appropriate cell lines 
or host systems can be chosen to ensure the correct modification and processing of the foreign protein 
expressed. To this end, eukaryotic host cells which possess the cellular machinery for proper 
processing of the primary transcript, glycosylation, and phosphorylation of the gene product may be 
used. Such mammalian host cells include but are not limited to CHO, VERO, BHK, HeLa, COS, 
MDCK, 293, 3T3, WI38, etc. 

For long-term, high-yield production of recombinant proteins, stable expression is preferred. 
For example, cell lines which stably express the nucleotide sequence encoded protein may be 
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engineered. Rather than using expression vectors which contain viral origins of replication, host cells 
can be transformed with DNA controlled by appropriate expression control elements (e.g., promoter, 
enhancer, sequences, transcription terminators, polyadenylation sites, etc.), and a selectable marker. 
Following the introduction of the foreign DNA, engineered cells may be allowed to grow for 1-2 days 
in an enriched media, and then are switched to a selective media. The selectable marker in the 
recombinant plasmid confers resistance to the selection and allows cells to stably integrate the plasmid 
into their chromosomes and grow to form foci which in turn can be cloned and expanded into cell lines. 
This method may advantageously be used to engineer cell lines which express nucleotide sequence 
encoded protein. Such engineered cell lines may be particularly useful in screening and evaluation of 
compounds that affect the endogenous activity of the nucleotide sequence encoded protein. 

A number of selection systems may be used, including but not limited to the herpes simplex 
virus thymidine kinase (Wigler, et al., 1977, Cell 1 1:223), hypoxanthine-guanine 
phosphoribosyltransferase (Szybalska & Szybalski, 1962, Proc. Natl. Acad. Sci. USA 48:2026), and 
adenine phosphoribosyltransferase (Lowy, et al., 1980, Cell 22:817) genes can be employed in tk-, 
hgprt- or aprt- cells, respectively. Also, antimetabolite resistance can be used as the basis of selection 
for dhfr, which confers resistance to methotrexate (Wigler, et al., 1980, Natl. Acad. Sci. USA 77:3567; 
O'Hare, et al., 1981, Proc. Natl. Acad. Sci. USA 78:1527); gpt, which confers resistance to 
mycophenolic acid (Mulligan & Berg, 1981, Proc. Natl. Acad. Sci. USA 78:2072); neo, which confers 
resistance to the aminoglycoside G-418 (Colberre-Garapin, et al., 1981, J. Mol. Biol. 150:1); and 
hygro, which confers resistance to hygromycin (Santerre, et al., 1984, Gene 30:147) genes. 

An alternative fusion protein system allows for the ready purification of non-denatured fusion 
proteins expressed in human cell lines (Janknecht, et al., 1991, Proc. Natl. Acad. Sci. USA 88: 8972- 
8976). In this system, the nucleotide sequence of interest is subcloned into a vaccinia recombination 
plasmid such that the nucleotide sequence's open reading frame is translationally fused to an amino- 
terminal tag consisting of six histidine residues. Extracts from cells infected with recombinant vaccinia 
virus are loaded onto Ni.sup.2 +-nitriloacetic acid-agarose columns and histidine-tagged proteins are 
selectively eluted with imidazole-containing buffers. 

Where recombinant DNA technology is used to produce the protein encoded by the nucleotide 
sequence for such assay systems, it may be advantageous to engineer fusion proteins that can facilitate 
labeling, immobilization and/or detection. 
Antibodies 

Indirect labeling involves the use of a protein, such as a labeled antibody, which specifically 
binds to the protein encoded by the nucleotide sequence. Such antibodies include but are not limited to 
polyclonal, monoclonal, chimeric, single chain, Fab fragments and fragments produced by an Fab 
expression library. 

The invention also provides for antibodies to the protein encoded by the nucleotide sequences. 
Described herein are methods for "the production of antibodies capable of specifically recognizing one 
or more nucleotide sequence epitopes. Such antibodies may include, but are not limited to polyclonal 
antibodies, monoclonal antibodies (mAbs), humanized or chimeric antibodies, single chain antibodies, 
Fab fragments, F(ab')2 fragments/fragments produced by a Fab expression library, anti-idiotypic (anti- 
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Id) antibodies, and epitope-binding fragments of any of the above. Such antibodies may be used, for 
example, in the detection of a nucleotide sequence in a biological sample, or, alternatively, as a method 
for the inhibition of abnormal gene activity, for example, the inhibition of a disease target nucleotide 
sequence, as further described below. Thus, such antibodies may be utilized as part of cardiovascular 
or other disease treatment method, and/or may be used as part of diagnostic techniques whereby 
patients may be tested for abnormal levels of nucleotide sequence encoded proteins, or for the presence 
of abnormal forms of the such proteins. 

For the production of antibodies to a nucleotide sequence, various host animals may be 
immunized by injection with a protein encoded by the nucleotide sequence, or a portion thereof. Such 
host animals may include but are not limited to rabbits, mice, and rats, to name but a few. Various 
adjuvants may be used to increase the immunological response, depending on the host species, 
including but not limited to Freund's (complete and incomplete), mineral gels such as aluminum 
hydroxide, surface active substances such as lysolecithin, pluronic polyols, polyanions, peptides, oil 
emulsions, keyhole limpet hemocyanin, dinitrophenol, and potentially useful human adjuvants such as 
BCG (bacille Calmette-Guerin) and Corynebacterium parvum. 

Polyclonal antibodies are heterogeneous populations of antibody molecules derived from the 
sera of animals immunized with an antigen, such as gene product, or an antigenic functional derivative 
thereof. For the production of polyclonal antibodies, host animals such as those described above, may 
be immunized by injection with gene product supplemented with adjuvants as also described above. 

Monoclonal antibodies, which are homogeneous populations of antibodies to a particular 
antigen, may be obtained by any technique which provides for the production of antibody molecules by 
continuous cell lines in culture. These include, but are not limited to the hybridoma technique of Kohler 
and Milstein, (1975, Nature 256:495-497; and U.S. Pat. No. 4,376,1 10), the human B-cell hybridoma 
technique (Kosbor et al., 1983, Immunology Today 4:72; Cole et al., 1983, Proc. Natl. Acad. Sci. USA 
80:2026-2030), and the EBV-hybridoma technique (Cole et al., 1985, Monoclonal Antibodies And 
Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). Such antibodies may be of any immunoglobulin class 
including IgG, IgM, IgE, IgA, IgD and any subclass thereof. The hybridoma producing the mAb of this 
invention may be cultivated in vitro or in vivo. 

In addition, techniques developed for the production of "chimeric antibodies" (Morrison et al., 

1984, Proc. Natl. Acad. Sci., 81:6851-6855; Neuberger et al., 1984, Nature, 312:604-608; Takeda et al., 

1985, Nature, 314:452-454) by splicing the genes from a mouse antibody molecule of appropriate 
antigen specificity together with genes from a human antibody molecule of appropriate biological 
activity can be used. A chimeric antibody is a molecule in which different portions are derived from 
different animal species, such as those having a variable region derived from a murine mAb and a 
human immunoglobulin constant region. 

Alternatively, techniques described for the production of single chain antibodies (U.S. Pat. 
No. 4,946,778; Bird, 1988, Science 242:423-426; Huston et al., 1988, Proc. Natl. Acad. Sci. USA 
85:5879-5883; and Ward et al., 1989, Nature 334:544-546) can be adapted to produce nucleotide 
sequence-single chain antibodies. Single chain antibodies are formed by linking the heavy and light 
chain fragments of the Fv region via an amino acid bridge, resulting in a single chain polypeptide. 
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Antibody fragments which recognize specific epitopes may be generated by known techniques 
For example, such fragments include but are not limited to: the F(ab')2 fragments which can be 
produced by pepsin digestion of the antibody molecule and the Fab fragments which can be generated 
by reducing the disulfide bridges of the F(ab')2 fragments/Alternatively, Fab expression libraries may 
be constructed (Huse et ah, 1989, Science, 246:1275-1281) to allow rapid and easy identification of 
monoclonal Fab fragments with the desired specificity. 

Disease specific target nucleotide sequences 
The invention also provides disease specific target nucleotide sequences, and sets of disease 
specific target nucleotide sequences. The diagnostic nucleotide sets, subsets thereof, novel nucleotide 
sequences, and individual members of the diagnostic nucleotide sets identified as described above are 
also disease specific target nucleotide sequences. In particular, individual nucleotide sequences that are 
differentially regulated or have predictive value that is strongly correlated with a disease or disease 
criterion are especially favorable as disease specific target nucleotide sequences. Sets of genes that are 
co-regulated may also be identified as disease specific target nucleotide sets. Such nucleotide 
sequences and/or nucleotide sequence products are targets for modulation by a variety of agents and 
techniques. For example, disease specific target nucleotide sequences (or the products of such 
nucleotide sequences, or sets of disease specific target nucleotide sequences) can be inhibited or 
activated by, e.g., target specific monoclonal antibodies or small molecule inhibitors, or delivery of the 
nucleotide sequence or gene product of the nucleotide sequence to patients. Also, sets of genes can be 
inhibited or activated by a variety of agents and techniques. The specific usefulness of the target 
nucleotide sequence(s) depends on the subject groups from which they were discovered, and the 
disease or disease criterion with which they correlate. 

Imaging 

The invention also provides for imaging reagents. The differentially expressed leukocyte 
nucleotide sequences, diagnostic nucleotide sets, or portions thereof, and novel nucleotide sequences of 
the invention are nucleotide sequences expressed in cells with or without disease. Leukocytes 
expressing a nucleotide sequence(s) that is differentially expressed in a disease condition may localize 
within the body to sites that are of interest for imaging purposes. For example, a leukocyte expressing 
a nucleotide sequence(s) that are differentially expressed in an individual having atherosclerosis may 
localize or accumulate at the site of an atherosclerotic placque. Such leukocytes, when labeled, may 
provide a detection reagent for use in imaging regions of the body where labeled leukocyte accumulate 
or localize, for example, at the atherosclerotic plaque in the case of atherosclerosis. For example, 
leukocytes are collected from a subject, labeled in vitro, and reintroduced into a subject. Alternatively, 
the labeled reagent is introduced into the subject individual, and leukocyte labeling occurs within the 
patient. 

Imaging agents that detect the imaging targets of the invention are produced by well-known 
molecular and immunological methods (for exemplary protocols, see, e.g., Ausubel, Berger, and 
Sambrook, as well as Harlow and Lane, supra). 

For example, a full-length nucleic acid sequence, or alternatively, a gene fragment encoding 
an immunogenic peptide or polypeptide fragments, is cloned into a convenient expression vector, for 
» • * » 
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example, a vector including an in-frame epitope or substrate binding tag to facilitate subsequent 
purification. Protein is then expressed from the cloned cDNA sequence and used to generate 
antibodies, or other specific binding molecules, to one or more antigens of the imaging target protein. 
Alternatively, a natural or synthetic polypeptide (or peptide) or small molecule that specifically binds ( 
or is specifically bound to) the expressed imaging target can be identified through well established 
techniques {see, e.g., Mendel et al. (2000) Anticancer Drug Des 1 5:29-4 1 ; Wilson (2000) Curr Med 
Chem 7:73-98; Hamby and Showwalter ( 1 999) Pharmacol Ther 82:169-93; and Shimazawa et al. 
(1998) Curr Opin Struct Biol 8:451-8). The binding molecule, e.g., antibody, small molecule ligand, 
etc., is labeled with a contrast agent or other detectable label, e.g., gadolinium, iodine, or a gamma- 
emitting source. For in-vivo imaging of a disease process that involved leukocytes, the labeled 
antibody is infused into a subject, e.g., a human patient or animal subject, and a sufficient period of 
time is passed to permit binding of the antibody to target cells. The subject is then imaged with 
appropriate technology such as MRI (when the label is gadolinium) or with a gamma counter (when the 
label is a gamma emitter). 

Identification of nucleotide sequence involved in l eukocyte adhesion 
The invention also encompasses a method of identifying nucleotide sequences involved in 
leukocyte adhesion. The interaction between the endothelial cell and leukocyte is a fundamental 
mechanism of all iiiflammatory disorders, including the diagnosis and prognosis of allograft rejection 
the diseases listed in Table 1. For example, the first visible abnormality in atherosclerosis is the 
adhesion to the endothelium and diapedesis of mononuclear cells (e.g., T-cell and monocyte). Insults 
to the endothelium (for example, cytokines, tobacco, diabetes, hypertension and many more) lead to 
endothelial cell activation. The endothelium then expresses adhesion molecules, which have counter 
receptors on mononuclear cells. Once the leukocyte receptors have bound the endothelial adhesion 
molecules, they stick to the endothelium, roll a short distance/stop and transmigrate across the 
endothelium. A similar set of events occurs in both acute and chronic inflammation. When the 
leukocyte binds the endothelial adhesion molecule, or to soluble cytokines secreted by endothelial or 
other cells, a program of gene expression is activated in the leukocyte. This program of expression 
leads to leukocyte rolling, firm adhesion and transmigration into the vessel wall or tissue parenchyma. 
Inhibition of this process is highly desirable goal in anti-inflammatory drug development. In addition, 
leukocyte nucleotide sequences and epithelial cell nucleotide sequences, that are differentially 
expressed during this process may be disease-specific target nucleotide sequences. 

Human endothelial cells, e.g. derived from human coronary arteries, human aorta, human 
pulmonary artery, human umbilical vein or microvascular endothelial cells, are cultured as a confluent 
monolayer, using standard methods. Some of the endothelial cells are then exposed to cytokines or 
another activating stimuli such as oxidized LDL, hyperglycemia, shear stress, or hypoxia (Moser et al. 
1992). Some endothelial cells are not exposed to such stimuli and serve as controls. For example, the 
endothelial cell monolayer is incubated with culture medium containing 5 U/ml of human recombinant 
IL-lalpha or 10 ng/ml TNF (tumor necrosis factor), for a period of minutes to overnight. The culture 
medium composition is changed or the flask is sealed to induce hypoxia. In addition, tissue culture 
plate is rotated to induce sheer stress. 
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Human T-cells and/or monocytes are cultured in tissue culture flasks or plates, with LGM-3 
media from Clonetics. Cells are incubated at 37 degree C, 5% C02 and 95% humidity. These 
leukocytes are exposed to the activated or control endothelial layer by adding a suspension of 
leukocytes on to the endothelial cell monolayer. The endothelial cell monolayer is cultured on a tissue 
culture treated plate/ flask or on a microporous membrane. After a variable duration of exposures, the 
endothelial cells and leukocytes "are harvested separately by treating all cells with trypsin and then 
sorting the endothelial cells from the leukocytes by magnetic affinity reagents to an endothelial eel! 
specific marker such as PECAM-1 (Stem Cell Technologies). RNA is extracted from the isolated cells 
by standard techniques. Leukocyte RNA is labeled as described above, and hybridized to leukocyte 
candidate nucleotide library. Epithelial cell RNA is also labeled and hybridized to the leukocyte 
candidate nucleotide library. Alternatively, the epithelial cell RNA is hybridized to a epithelial cell 
candidate nucleotide library, prepared according to the methods described for leukocyte candidate 
libraries, above. 

Hybridization to candidate nucleotide libraries will reveal nucleotide sequences that are up- 
regulated or down-regulated in leukocyte and/or epithelial cells undergoing adhesion. The 
differentially regulated nucleotide sequences are further characterized, e.g. by isolating and sequencing 
the full-length sequence, analysis of the DNA and predicted protein sequence, and functional 
characterization of the protein product of the nucleotide sequence, as described above. Further 
characterization may result in the identification of leukocyte adhesion specific target nucleotide 
sequences, which may be candidate targets for regulation of the inflammatory process. Small molecule 
or antibody inhibitors can be developed to inhibit the target nucleotide sequence function. Such 
inhibitors are tested for their ability to inhibit leukocyte adhesion in the in vitro test described above. 
Integrated systems 

Integrated systems for the collection and analysis of expression profiles, and molecular 
signatures, as well as for the compilation, storage and access of the databases of the invention, typically 
include a digital computer with software including an instruction set for sequence searching and 
analysis, and, optionally, high-throughput liquid control software, image analysis software, data 
interpretation software, a robotic control armature for transferring solutions from a source to a 
destination (such as a detection device) operably linked to the digital computer, an input device (e.g., a 
computer keyboard) for entering subject data to the digital computer, or to control analysis operations 
or high throughput sample transfer by the robotic control armature. Optionally, the integrated system 
further comprises an image scanner for digitizing label signals from labeled assay Components, e.g., 
labeled nucleic acid hybridized to a candidate library microarray. The image scanner can interface with 
image analysis software to provide a measurement of the presence or intensity of the hybridized label, 
i.e., indicative of an on/ofT expression pattern or an increase or decrease in expression. 

Readily available computational hardware resources using standard operating systems are 
fully adequate, e.g., a PC (Intel x86 or Pentium chip- compatible DOS,™ OS2 ,™ WINDOWS,™ 
WINDOWS NT,™ WINDOWS95 ,™ WINDOWS98,™ LINUX, or even Macintosh, Sun or PCs will 
suffice) for use in the integrated systems of the invention. Current art in software technology is 
similarly adequate (i.e., there are a multitude of mature programming languages and source code 
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suppliers) for design, e.g., of an upgradeable open-architecture object-oriented heuristic algorithm, or 
instruction set for expression analysis, as described herein. For example, software for aligning or 
otherwise manipulating ,molecular signatures can be constructed by one of skill using a standard 
programming language such as Visual basic, Fortran, Basic, Java, or the like, according to the methods 
herein. 

Various methods and algorithms, including genetic algorithms and neural networks, can be 
used to perform the data collection, correlation, and storage functions, as well as other desirable 
functions, as described herein. In addition, digital or analog systems such as digital or analog computer 
systems can control a variety of other functions such as the display and/or control of input and output 
files. 

For example, standard desktop applications such as word processing software (e.g., Corel 
WordPerfect™ or Microsoft Word™) and database software (e.g., spreadsheet software such as Corel 
Quattro Pro™, Microsoft Excel™, or database programs such as Microsoft Access™ or Paradox™) 
can be adapted to the present invention by inputting one or more character string corresponding, e.g., to 
an expression pattern or profile, subject medical or historical data, molecular signature, or the like, into 
the software which is loaded into the memory of a digital system, and carrying out the operations 
indicated in an instruction set, e.g., as exemplified in Figure 2. For example, systems can include the 
foregoing software having the appropriate character string information, e.g., used in conjunction with a 
user interface in conjunction with a standard operating system such as a Windows, Macintosh or 
LINUX system. For example, an instruction set for manipulating strings of characters, either by 
programming the required operations into the applications or with the required operations performed 
manually by a user (or both). For example, specialized sequence alignment programs such as P1LEUP 
or BLAST can also be incorporated into the systems of the invention, e.g., for alignment of nucleic 
acids or proteins (or corresponding character strings). 

Software for performing the statistical methods required for the invention, e.g., to detennine 
correlations between expression profiles and subsets of members of the diagnostic nucleotide libraries, 
such as programmed embodiments of the statistical methods described above, are also included in the 
computer systems of the invention. Alternatively, programming elements for performing such methods 
as principle component analysis (PCA) or least squares analysis can also be included in the digital 
system to identify relationships between data. Exemplary software for such methods is provided by 
Partek, Inc., St. Peter, Mo; at the web site partek.com. 

Any controller or computer optionally includes a monitor which can include, e.g., a flat panel 
display (e.g., active matrix liquid crystal display, liquid crystal display), a cathode ray tube ("CRT") 
display, or another display system which serves as a user interface, e.g., to output predictive data. 
Computer circuitry, including numerous integrated circuit chips, such as a microprocessor, memory, 
interface circuits, and the like, is often placed in a casing or box which optionally also includes a hard 
disk drive, a floppy disk drive, a high capacity removable drive such as a writeable CD-ROM, and 
other common peripheral elements. 

Inputting devices such as a keyboard, mouse, or touch sensitive screen, optionally provide for 
input from a user and for user selection, e.g., of sequences or data sets to be compared or otherwise 
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manipulated in the relevant computer system. The computer typically includes appropriate software for 
receiving user instructions, either in the form of user input into a set parameter or data fields (e.g., to 
input relevant subject data), or in the form of preprogrammed instructions, e.g., preprogrammed for a 
variety of different specific operations. The software then converts these instructions to appropriate 
language for instructing the system to carry but any desired operation. 

The integrated system may also be embodied within the circuitry of an application specific 
integrated circuit (ASIC) or programmable logic device (PLD). In such a case, the invention is 
embodied in a computer readable descriptor language that can be used to create an ASIC or PLD. The 
integrated system can also be embodied within the circuitry or logic processors of a variety of other 
digital apparatus, such as PDAs, laptop computer systems, displays, image editing equipment, etc. 

The digital system can comprise a learning component where expression profiles, and relevant 
subject data are compiled and monitored in conjunction with physical assays, and where correlations, 
e.g., molecular signatures with predictive value for a disease, are established or refmed. Successful and 
unsuccessful combinations are optionally documented in a database to provide justification/preferences 
for user-base or digital system based selection of diagnostic nucleotide sets with high predictive 
accuracy for a specified disease or condition. 

The integrated systems can also include an automated workstation. For example, such a 
workstation can prepare and analyze leukocyte RNA samples by performing a sequence of events 
including: preparing RNA from a human blood sample; labeling the RNA with an isotopic or non- 
isotopic label; hybridizing the labeled RNA to at least one array comprising all or part of the candidate 
library; and detecting the hybridization pattern. The hybridization pattern is digitized and recorded in 
the appropriate database. 

Automated RNA preparation too! 

The invention also includes an automated RNA preparation tool for the preparation of 
mononuclear cells from whole blood samples, and preparation of RNA from the mononuclear cells. In 
a preferred embodiment, the use of the RNA preparation tool is fully automated, so that the cell 
separation and RNA isolation would require no human manipulations. Full automation is 
advantageous because it minimizes delay, and standardizes sample preparation across different 
laboratories. This standardization increases the reproducibility of the results. 

Figure 2 depicts the processes performed by the RNA preparation tool of the invention. A 
primary component of the device is a centrifuge (A). Tubes of whole blood containing a density 
gradient solution, transcription/translation inhibitors, and a gel barrier that separates erythrocytes from 
mononuclear cells and serum after centrifugation are placed in the centrifuge (B). The barrier is 
permeable to erythrocytes and granulocytes during centrifugation, but does not allow mononuclear 
cells to pass through (or the barrier substance has a density such that mononuclear cells remain above 
the level of the barrier during the centrifugation ). After centrifugation, the erythrocytes and 
granulocytes are trapped beneath the barrier, facilitating isolation of the mononuclear cell and serum 
layers. A mechanical arm removes the tube and inverts it to mix the mononuclear cell layer and the 
serum (C). The arm next pours the supernatant into a fresh tube (D), while the erythrocytes and 
granulocytes remained below the barrier. Alternatively, a needle is used to aspirate the supernatant and 
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transfer it to a fresh tube. The mechanical arms of the device opens and closes lids, dispenses PBS to 
aid in the collection of the mononuclear cells by centrifugation, and moves the tubes in and out of the 
centrifuge. Following centrifugation, the supernatant is poured off or removed by a vacuum device (E), 
leaving an isolated mononuclear cell pellet. Purification of the RNA from the cells is performed 
automatically, with lysis buffer and other purification solutions (F) automatically dispensed and 
removed before and after centrifugation steps. The result is a purified RNA solution. In another 
embodiment, RNA isolation is performed using a column or filter method. In yet another embodiment, 
the invention includes an on-board homogenizer for use in cell lysis. 
Other automated systems 
Automated and/or semi-automated methods for solid and liquid phase high-throughput sample 
preparation and evaluation are available, and supported by commercially available devices. For 
example, robotic devices for preparation of nucleic acids from bacterial colonies, e.g., to facilitate 
production and characterization of the candidate library include, for example, an automated colony 
picker (e.g., the Q-bot, Genetix, U.K.) capable of identifying, sampling, and inoculating up to 10,000/4 
hrs different clones into 96 well microtiter dishes. Alternatively, or in addition, robotic systems for 
liquid handling are available from a variety of sources, e.g., automated workstations like the automated 
synthesis apparatus developed by Takeda Chemical Industries, LTD. (Osaka, Japan) and many robotic 
systems utilizing robotic arms (Zymate II, Zymark Corporation, Hopkinton, Mass.; Orca, Beckman 
Coulter, Inc. (Fullerton, CA)) which mimic the manual operations performed by a scientist Any of the 
above devices are suitable for use with the present invention, e.g., for high-throughput analysis of 
library components or subject leukocyte samples. The nature and implementation of modifications to 
these devices (if any) so that they can operate as discussed herein will be apparent to persons skilled in 
the relevant art. 

High throughput screening systems that automate entire procedures, e.g., sample and reagent 
pipetting, liquid dispensing, timed incubations, and final readings of the microplate in detectors) 
appropriate for the relevant assay are commercially available. (see t e.g., Zymark Corp., Hopkinton, 
MA; Air Technical Industries, Mentor, OH; Beckman Instruments, Inc. Fullerton, CA; Precision 
Systems, Inc., Natick, MA, etc.). These configurable systems provide high throughput and rapid start 
up as well as a high degree of flexibility and customization. Similarly, arrays and array readers are 
available, e.g., from Affymetrix, PE Biosystems, and others. 

The manufacturers of such systems provide detailed protocols the various high throughput. 
Thus, for example, Zymark Corp. provides technical bulletins describing screening systems for 
detecting the modulation of gene transcription, ligand binding, and the like. 

A variety of commercially available peripheral equipment, including, e.g., optical and 
fluorescent detectors, optical and fluorescent microscopes, plate readers, CCD arrays, 
phosphorimagers, scintillation counters, phototubes, photodiodes, and the like, and software is 
available for digitizing, storing and analyzing a digitized video or digitized optical or other assay 
results, e.g., using PC (Intel x86 or pentium chip- compatible DOS™, OS2™ WINDOWS™, 
WINDOWS NT™ or WINDOWS 9 5™ based machines), MACINTOSH™, or UNIX based (e.g., 
SUN™ work station) computers. 
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Embodiment in a web site. 

The methods described above can be implemented in a localized or distributed computing 
environment. For example, if a localized computing environment is used, an array comprising a 
candidate nucleotide library, or diagnostic nucleotide set, is configured in proximity to a detector, 
which is, in turn, linked to a computational device equipped with user input and output features. 

In a distributed environment, the methods can be implemented on a single computer with 
multiple processors or, alternatively, on multiple computers. The computers can be linked, e.g. through 
a shared bus, but more commonly, the computer(s) are nodes on a network. The network can be 
generalized or dedicated, at a local level or distributed over a wide geographic area. In certain 
embodiments, the computers are components of an intra-net or an internet. 

The predictive data corresponding to subject molecular signatures (e.g., expression profiles, 
and related diagnostic, prognostic, or monitoring results) can be shared by a variety of parties. In 
particular, such information can be utilized by the subject, the subject's health care practitioner or 
provider, a company or other institution, or a scientist. An individual subject's data, a subset of the 
database or the entire database recorded in a computer readable medium can be accessed directly by a 
user by any method of communication, including, but not limited to, the internet. With appropriate 
computational devices, integrated systems, communications networks, users at remote locations, as 
well as users located in proximity to, e.g., at the same physical facility, the database can access the 
recorded information. Optionally, access to the database can be controlled using unique alphanumeric 
passwords that provide access to a subset of the data. Such provisions can be used, e.g., to ensure 
privacy, anonymity, etc. 

Typically, a client (e.g., a patient, practitioner, provider, scientist, or the like) executes a Web 
browser and is linked to a server computer executing a Web server. The Web browser is, for example, 
a program such as IBM's Web Explorer, Internet explorer, NetScape or Mosaic, or the like. The Web 
server is typically, but not necessarily, a program such as IBM's HTTP Daemon or other WWW 
daemon (e.g., LINUX-based forms of the program). The client computer is bi-directionally coupled 
with the server computer over a line or via a wireless system. In turn, the server computer is bi- 
directionally coupled with a website (server hosting the website) providing access to software 
implementing the methods of this invention. 

A user of a client connected to the Intranet or Internet may cause the client to request 
resources that are part of the web site(s) hosting the application(s) providing an implementation of the 
methods described herein. Server program(s) then process the request to return the specified resources 
(assuming they are currently available). A standard naming convention has been adopted, known as a 
Uniform Resource Locator ("URL"). This convention encompasses several types of location names, 
presently including subclasses such as Hypertext Transport Protocol ("http"), File Transport Protocol 
("ftp"), gopher, and Wide Area Information Service ("WAIS"). When a resource is downloaded, it may 
include the URLs of additional resources. Thus, the user of the client can easily learn of the existence 
of new resources that he or she had not specifically requested. 

Methods of implementing Intranet and/or Intranet embodiments of computational and/or data 
access processes are well known to those of skill in the art and are documented, e.g., in ACM Press, pp. 
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383-392; ISO-ANSI, Working Draft, "Information Techno logy- Database Language SQL", Jim 
Melton, Editor, International Organization for Standardization and American National Standards 
Institute, Jul. 1992; ISO Working Draft, "Database Language SQL-Part 2:Foundation 
(SQL/Foundation)", CD9075-2: 199.chi.SQL, Sep. 11, 1997; and Cluer et al. (1992) A General 
Framework for the Optimization of Object-Oriented Queries, Proc SIGMOD International Conference 
on Management of Data, San Diego, California, Jun. 2-5, 1992, SIGMOD Record, vol. 21, Issue 2, 
Jun., 1992; Stonebraker, M., Editor;. Other resources are available, e.g., from Microsoft, IBM, Sun and 
other software development companies. 

Using the tools described above, users of the reagents, methods and database as discovery or 
diagnostic tools can query a centrally located database with expression and subject data. Each 
submission of data adds to the sum of expression and subject information in the database. As data is 
added, a new correlation statistical analysis is automatically run that incorporates the added clinical and 
expression data. Accordingly, the predictive accuracy and the types of correlations of the recorded 
molecular signatures increases as the database grows. 

For example, subjects, such as patients, can access the results of the expression analysis of 
their leukocyte samples and any accrued knowledge regarding the likelihood of the patient's belonging 
to any specified diagnostic (or prognostic, or monitoring, or risk group), i.e., their expression profiles, 
and/or molecular signatures. Optionally, subjects can add to the predictive accuracy of the database by 
providing additional information to the database regarding diagnoses, test results, clinical or other 
related events that have occurred since the time of the expression profiling. Such information can be 
provided to the database via any form of communication, including, but not limited to, the internet. 
Such data can be used to continually define (and redefine) diagnostic groups. For example, if 1000 
patients submit data regarding the occurrence of myocardial infarction over the 5 years since their 
expression profiling, and 300 of these patients report that they have experienced a myocardial 
infarction and 700 report that they have not, then the 300 patients define a new "group A." As the 
algorithm is used to continually query and revise the database, a new diagnostic nucleotide set that 
differentiates groups A and B (i.e., with and without myocardial infarction within a five year period) is 
identified. This newly defined nucleotide set is then be used (in the manner described above) as a test 
that predicts the occurrence of myocardial infarction over a five-year period. While submission 
directly by the patient is exemplified above, any individual with access and authority to submit the 
relevant data e.g., the patient's physician, a laboratory technician, a health care or study administrator, 
or the like, can do so. 

As will be apparent from the above examples, transmission of information via the internet (or 
via an intranet) is optionally bi-directional. That is, for example, data regarding expression profiles, 
subject data, and the like are transmitted via a communication system to the database, while 
information regarding molecular signatures, predictive analysis, and the like, are transmitted from the 
database to the user. For example, using appropriate configurations of an integrated system including a 
microarray comprising a diagnostic nucleotide set, a detector linked to a computational device can 
directly transmit (locally or from a remote workstation at great distance, e.g., hundreds or thousands of 
miles distant from the database) expression profiles and a corresponding individual identifier to a 
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central database for analysis according to the methods of the invention. According to, e.g., the 
algorithms described above, the individual identifier is assigned to one or more diagnostic (or 
prognostic, or monitoring, etc.) categories. The results of this classification are then relayed back, via, 
e.g., the same mode of communication, to a recipient at the same or different internet (or intranet) 
address. 
Kits 

The present invention is optionally provided to a user as a kit. Typically, a kit contains one or 
more diagnostic nucleotide sets of the invention. Alternatively, the kit contains the candidate 
nucleotide library of the invention. Most often, the kit contains a diagnostic nucleotide probe set, or 
other subset of a candidate library, e.g., as a cDNA or antibody microarray packaged in a suitable 
container. The kit may further comprise, one or more additional reagents, e.g., substrates, labels, 
primers, for labeling expression products, tubes and/or other accessories, reagents for collecting blood 
samples, buffers, e.g., erythrocyte lysis buffer, leukocyte lysis buffer, hybridization chambers, cover 
slips, etc., as well as a software package, e.g., including the statistical methods of the invention, e.g., as 
described above, and a password and/or account number for accessing the compiled database. The kit 
optionally further comprises an instruction set or user manual detailing preferred methods of using the 
diagnostic nucleotide sets in the methods of the invention. In one embodiment, the kit may include 
contents useful for the discovery of diagnostic nucleotide sets using microarrays. The kit may include 
sterile, endotoxin and RNAse free blood collection tubes. The kit may also include alcohol swabs, 
tourniquet, blood collection set, and/or PBS (phosphate buffer saline; needed when method of example 
2 is used to derived mononuclear RNA). The kit may also include cell lysis buffer. The kit may 
include RNA isolation kit, substrates for labeling of RNA (may vary for various expression profiling 
techniques). The kit may also include materials for fluorescence microarray expression profiling, 
including one or more of the following: reverse transcriptase and lOx RT buffer, T7(dT)24 primer 
(primer with T7 promoter at 5' end), DTT, deoxynucleotides, optionally lOOmM each, RNAse 
inhibitor, second strand cDNA buffer, DNA polymerase, Rnase H, T7 RNA polymerase 
ribonucleotides, in vitro transcription buffer, and/or Cy3 and Cy5 labeled ribonucleotides. The kit may 
also include microarrays containing candidate gene libraries, cover slips for slides, and/or hybridization 
chambers. The kit may further include software package for identification of diagnostic gene set from 
data, that contains statistical methods, and/or allows alteration in desired sensitivity and specificity of 
gene set. The software may further facilitate access to and data analysis by centrally a located database 
server. The software may further include a password and account number to access central database 
server. In addition, the kit may include a kit user manual. 

In another embodiment, the kit may include contents useful for the application of diagnostic 
nucleotide sets using microarrays. The kit may include sterile, endotoxin and/or RNAse free blood 
collection tubes. The kit may also include, alcohol swabs, tourniquet, and/or a blood collection set. 
The kit may further include PBS (phosphate buffer saline; needed when method of example 2 is used to 
derived mononuclear RNA), cell lysis buffer, and/or an RNA isolation kit. In addition, the kit may 
include substrates for labeling of RNA (may vary for various expression profiling techniques). For 
fluorescence microarray expression profiling, components may include reverse transcriptase and lOx 
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RT buffer, T7(dT)24 primer (primer with T7 promoter at 5* end), DTT, deoxyuucleo tides (optionally 
lOOmM each), RNAse inhibitor, second strand cDNA buffer, DNA polymerase, Rnase H, T7 RNA 
polymerase, ribonucleotides, in vitro transcription buffer, and/or Cy3 and Cy5 labeled ribonucleotides. 
The kit may further include microarrays containing candidate gene libraries. The kit may also include 
cover slips for slides, and/or hybridization chambers. The kit may include a software package for 
identification of diagnostic gene set from data. The software package may contain statistical methods, 
allow alteration in desired sensitivity and specificity of gene set, and/or facilitate access to and data 
analysis by centrally located database server. The software package may include a password and 
account number to access central database server. In addition, the kit may include a kit user manual. 

In another embodiment, the kit may include contents useful for the application of diagnostic 
nucleotide sets using real-time PCR. This kit may include terile, endotoxin and/or RNAse free blood 
collection tubes. The kit may further include alcohol swabs, tourniquet, and/or a blood collection set. 
The kit may also include PBS (phosphate buffer saline; needed when method of example 2 is used to 
derived mononuclear RNA). In addition, the kit may include cell lysis buffer and/or an RNA isolation 
kit. The kit may laso include substrates for real time RT-PCR, which may vary for various real-time 
PCR techniques, including poly dT primers, random hexamer primers, reverse Transcriptase and RT 
buffer, DTT, deoxynucleotides 100 mM, RNase H, primer pairs for diagnostic and control gene set, 
lOx PCR reaction buffer, and/or Taq DNA polymerase. The kit may also include fluorescent probes 
for diagnostic and control gene set (alternatively, fluorescent dye that binds to only double stranded 
DNA). The kit may further include reaction tubes with or without barcode for sample tracking, 96-well 
plates with barcode for sample identification, one barcode for entire set, or individual barcode per 
reaction tube in plate. The kit may also include a software package for identification of diagnostic gene 
set from data, and /or statistical methods. The software package may allow alteration in desired 
sensitivity and specificity of gene set, and/or facilitate access to and data analysis by centrally located 
database server. The kit may include a password and account number to access central database server. 
Finally, the kit may include a kit user manual. 

This invention will be better understood by reference to the following non-limiting Examples: 
LIST OF EXAMPLE TITLES 

Example I: Preparation of a leukocyte cDNA array comprising a candidate gene library 

Example 2: Preparation of RNA from mononuclear cells for expression profiling 

Example 3: Preparation of Universal Control RNA for use in leukocyte expression profiling 

Example 4. RNA Labeling and hybridization to a leukocyte cDNA array of candidate nucleotide 

sequences. 

Example 5: Clinical study for the Identification of diagnostic gene sets useful in diagnosis and 
treatment of Cardiac allograft rejection 

Example 6: Identification of diagnostic nucleotide sets for kidney and liver allograft rejection 
Example 7: Identification of diagnostic nucleotide sets for diagnosis of cytomegalovirus 
Example 8: Design of oligonucleotide probes 

Example 9: Production of an array of 8,000 spotted 50 mer oligonucleotides. 
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Example 10: Identification of diagnostic nucleotide sets for diagnosis of Cardiac Allograft Rejection 
using microarrays 

Example 1 1: Amplification, labeling, and hybridization of total RNA to an oligonucleotide microarray 
Example 12: Real -time PCR validation of array expression results 
Example 13: Real-time PCR expression markers of acute allograft rejection 

Example 14: Identification of diagnostic nucleotide sets for diagnosis of Cardiac Allograft Rejection 
using microarrays 

Example 15: Correlation and Classification Analysis 

Example 16: Acute allograft rejection: biopsy tissue gene expression profiling 

Example 1 7: Microarray and PCR gene expression panels for diagnosis and monitoring of acute 

allograft rejection 

Example 18: Assay sample preparation 

Example 19: Allograft rejection diagnostic gene sequence analysis 
Example 20: Detection of proteins expressed by diagnostic gene sequences 
Example 21: Detecting changes in the rate of hematopoiesis 

Examples 

Example 1: Preparation of a leukocyte cDNA array comprising a candidate gene 
library 

Candidate genes and gene sequences for leukocyte expression profiling are identified through methods 
described elsewhere in this document. Candidate genes are used to obtain or design probes for 
peripheral leukocyte expression profiling in a variety of ways. 

A cDNA microarray carrying 384 probes was constructed using sequences selected from the initial 
candidate library. cDNAs is selected from T-cell libraries, PBMC libraries and buffy coat libraries. 
96- Well PCR 

Plasmids are isolated in 96-well format and PCR was performed in 96-well format. A master mix is 

made that contain the reaction buffer, dNTPs, forward and reverse primer and DNA polymerase was 

made. 99 ul of the master mix was aliquoted into 96-well plate. 1 ul of plasmid (1-2 ng/ul) of plasmid 

was added to the plate. The final reaction concentration was 10 mM Tris pH 8.3, 3.5 mM MgC12, 25 

mM KC1, 0.4 mM dNTPs, 0.4 uM M13 forward primer, 0.4 Ml 3 reverse primer, and 10 U of Taq Gold 

(Applied Biosystems). The PCR conditions were: 

Step 1 95C for 10 min 

Step2 95Cfor 15 sec 

Step 3 56C for 30 sec 

Step 4 72C for 2 min 1 5 seconds 

Step 5 go to Step 2 39 times 

Step 6 72C for 10 minutes 

Step 7 4C for ever. 

PCR Purification 

PCR purification is done in a 96-well format. The Arraylt (Telechem International, Inc.) PCR 
purification kit is used and the provided protocol was followed without modification. Before the 
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sample is evaporated to dryness, the concentration ofPCR products was determined using a 
spectrophotometer. After evaporation, the samples are re-suspended in lx Micro Spotting Solution 
(Arraylt) so that the majority of the samples were between 0.2-1.0 ug/ul. 
Array Fabrication 

Spotted cDNA microarrays are then made from these PCR products by Arraylt using their protocols, 
which may be found at the Arraylt website. Each fragment was spotted 3 times onto each array. 
Candidate genes and gene sequences for leukocyte expression profiling are identified through methods 
described elsewhere in this document. Those candidate genes are used for peripheral leukocyte 
expression profiling. The candidate libraries can used to obtain or design probes for expression 
profiling in a variety of ways. 

Oligonucleotide probes are prepared using the gene sequences of Table 2, Table 8, and the sequence 
listing. Oligo probes are designed on a contract basis by various companies (for example, Compugen, 
Mergen, Affymetrix, Telechem), or designed from the candidate sequences using a variety of 
parameters and algorithms as indicated at located at the MIT web site. Briefly, the length of the 
oligonucleotide to be synthesized is determined, preferably greater than 18 nucleotides, generally 18-24 
nucleotides, 24-70 nucleotides and, in some circumstances, more than 70 nucleotides. The sequence 
analysis algorithms and tools described above are applied to the sequences to mask repetitive elements, 
vector sequences and low complexity sequences. Oligonucleotides are selected that are specific to the 
candidate nucleotide sequence (based on a Blast n search of the oligonucleotide sequence in question 
against gene sequences databases, such as the Human Genome Sequence, UniGene, dbEST or the non- 
redundant database at NCBI), and have <50% G content and 25-70% G+C content. Desired 
oligonucleotides are synthesized using well-known methods and apparatus, or ordered from a company 
(for example Sigma). Oligonucleotides are spotted onto microarrays. Alternatively, oligonucleotides 
are synthesized directly on the array surface, using a variety of techniques (Hughes et al. 2001, 
Yershov et al. 1 996, Lockhart et al 1 996). 

Example 2: Preparation ofRNA from mononuclear cells for exp ression profiling 
Blood was isolated from the subject for leukocyte expression profiling using the following methods: 
Two tubes were drawn per patient. Blood was drawn from either a standard peripheral venous blood 
draw or directly from a large-bore intra-arterial or intravenous catheter inserted in the femoral artery, 
femoral vein, subclavian vein or internal jugular vein. Care was taken to avoid sample contamination 
with heparin from the intravascular catheters, as heparin can interfere with subsequent RNA reactions. 
For each tube, 8 ml of whole blood was drawn into a tube (CPT, Becton-Dickinson order #362753) 
containing the anticoagulant Citrate, 25°C density gradient solution (e.g. Ficoll, Percoll) and a 
polyester gel barrier that upon centrifugation was permeable to RBCs and granulocytes but not to 
mononuclear cells. The tube was inverted several times to mix the blood with the anticoagulant. The 
tubes were centrifuged at 1750xg in a swing-out rotor at room temperature for 20 minutes. The tubes 
were removed from the centrifuge and inverted 5-10 times to mix the plasma with the mononuclear 
cells, while trapping the RBCs and the granulocytes beneath the gel barrier. The plasma/mononuclear 
cell mix was decanted into a 15ml tube and 5ml of phosphate-buffered saline (PBS) is added. The 
1 5ml tubes were spun for 5 minutes at 1750xg to pellet the cells. The supernatant was discarded and 
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1.8 nil of RLT lysis buffer is added to the mononuclear cell pellet. The buffer and cells were pipetted 
up and down to ensure complete lysis of the pellet. The cell lysate was frozen and stored until it is 
convenient to proceed with isolation of total RNA. 

Total RNA was purified from the lysed mononuclear cells using the Qiagen Rneasy Miniprep kit, as 
directed by the manufacturer (10/99 version) for total RNA isolation, including homogenization 
(Qiashredder columns) and on-column DNase treatment. The purified RNA was eluted in 50ul of 
water. The further use of RNA prepared by this method is described in Examples 1 0 and 1 1 . 
Some samples were prepared by a different protocol, as follows: 

Two 8 ml blood samples were drawn from a peripheral vein into a tube (CPT, Bee ton-Dickinson order 
#362753) containing anticoagulant (Citrate), 25°C density gradient solution (Ficoll) and a polyester gel 
barrier that upon centrifugation is permeable to RBCs and granulocytes but not to mononuclear cells. 
The mononuclear cells and plasma remained above the barrier while the RBCs and granulocytes were 
trapped below. The tube was inverted several times to mix the blood with the anticoagulant, and the 
tubes were subjected to centrifugation at 1750xg in a swing-out rotor at room temperature for 20 min. 
The tubes were removed from the centrifuge, and the clear plasma layer above the cloudy mononuclear 
cell layer was aspirated and discarded. The cloudy mononuclear cell layer was aspirated, with care 
taken to rinse all of the mononuclear cells from the surface of the gel barrier with PBS (phosphate 
buffered saline). Approximately 2 mis of mononuclear cell suspension was transferred to a 2ml 
microcentrifuge tube, and centrifuged for 3min. at 16,000 rpm in a microcentrifuge to pellet the cells. 
The supernatant was discarded and 1.8 ml of RLT lysis buffer (Qiagen) were added to the mononuclear 
cell pellet, which lysed the cells and inactivated Rnases. The cells and lysis buffer were pipetted up 
and down to ensure complete lysis of the pellet. Cell lysate was frozen and stored until it was 
convenient to proceed with isolation of total RNA. 

RNA samples were isolated from 8 mL of whole blood. Yields ranged from 2 ug to 20ug total RNA 
for 8mL blood. A260/A280 spectrophotometric ratios were between 1.6 and 2.0, indicating purity of 
sample. 2ul of each sample were run on an agarose gel in the presence of ethidium bromide. No 
degradation of the RNA sample and no DNA contamination was visible. 

In some cases, specific subsets of mononuclear cells were isolated from peripheral blood of human 
subjects. When this was done, the StemSep cell separation kits (manual version 6.0.0) were used from 
StemCell Technologies (Vancouver, Canada). This same protocol can be applied to the isolation of T 
cells, CD4 T cells, CD8 T cells, B cells, monocytes, NK cells and other cells. Isolation of cell types 
using negative selection with antibodies may be desirable to avoid activation of target cells by 
antibodies. 

Example 3: Preparation of Universal Control RNA for use in leukocyte expression profiling 
Control RNA was prepared using total RNA from Buffy coats and/or total RNA from enriched 
mononuclear cells isolated from Buffy coats, both with and without stimulation with ionomycin and 
PMA. The following control RNAs were prepared: 
Control 1 : Buffy Coat Total RNA 
Control 2: Mononuclear cell Total RNA 
Control 3: Stimulated buffy coat Total RNA 
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Control 4: Stimulated mononuclear Total RNA 

Control 5: 50% Buffy coat Total RNA / 50% Stimulated buffy coat Total RNA 

Control 6: 50% Mononuclear cell Total RNA / 50% Stimulated Mononuclear Total RNA 

Some samples were prepared using the following protocol: Buffy coats from 38 individuals were 

obtained from Stanford Blood Center. Each buffy coat is derived from -350 mL whole blood from 

one individual. 10 ml buffy coat was removed from the bag, and placed into a 50 ml tube. 40 ml of 

Buffer EL (Qiagen) was added, the tube was mixed and placed on ice for 15 minutes, then cells were 

pelleted by cenrrifugation at 2000xg for 10 minutes at 4°C. The supernatant was decanted and the cell 

pellet was re-suspended in 10 ml of Qiagen Buffer EL. The tube was then centrifuged at 2000xg for 10 

minutes at 4°C. The cell pellet was then re-suspended in 20 ml TRIZOL (GibcoBRL) per Buffy coat 

sample, the mixture was shredded using a rotary homogenizer, and the lysate was then frozen at -80°C 

prior to proceeding to RNA isolation. 

Other control RNAs were prepared from enriched mononuclear cells prepared from Buffy coats. Buffy 
coats from Stanford Blood Center were obtained, as described above. 10 ml buffy coat was added to a 
50 ml polypropylene tube, and 10 ml of phosphate buffer saline (PBS) was added to each tube. A 
polysucrose (5.7 g/dL) and sodium diatrizoate (9.0 g/dL) solution at a 1 .077 +/-0.0001 g/ml density 
solution of equal volume to diluted sample was prepared (Histopaque 1077, Sigma cat. no 1077-1). 
This and all subsequent steps were performed at room temperature. 15 ml of diluted buffy coat/PBS 
was layered on top of 15 ml of the histopaque solution in a 50 ml tube. The tube was centrifuged at 
400xg for 30 minutes at room temperature. After cenrrifugation, the upper layer of the solution to 
within 0.5 cm of the opaque interface containing the mononuclear cells was discarded. The opaque 
interface was transferred into a clean centrifuge tube. An equal volume of PBS was added to each tube 
and centrifuged at 350xg for 10 minutes at room temperature. The supernatant was discarded. 5 ml of 
Buffer EL (Qiagen) was used to resuspend the remaining cell pellet and the tube was centrifuged at 
2000xg for 10 minutes at room temperature. The supernatant was discarded. The pellet was 
resuspended in 20 ml of TRIZOL (GibcoBRL) for each individual buffy coat that was processed. The 
sample was homogenized using a rotary homogenizer and frozen at -80C until RNA was isolated. 
RNA was isolated from frozen lysed Buffy coat samples as follows: frozen samples were thawed, and 
4 ml of chloroform was added to each buffy coat sample. The sample was mixed by vortexing and 
centrifuged at 2000xg for 5 minutes. The aqueous layer was moved to new tube and then repurified by 
using the RNeasy Maxi RNA clean up kit, according to the manufacturer's instruction (Qiagen, PN 
75 162). The yield, purity and integrity were assessed by spectrophotometer and gel electrophoresis. 
Some samples were prepared by a different protocol, as follows. The further use of RNA prepared 
using this protocol is described in Example 11. 

50 whole blood samples were randomly selected from consented blood donors at the Stanford Medical 
School Blood Center. Each buffy coat sample was produced from -350 mL of an individual's donated 
blood. The whole blood sample was centrifuged at -4,400 x g for 8 minutes at room temperature, 
resulting in three distinct layers: a top layer of plasma, a second layer of buffy coat, and a third layer of 
red blood cells. 25 ml of the buffy coat fraction was obtained and diluted with an equal volume of PBS 
(phosphate buffered saline). 30 ml of diluted buffy coat was layered onto 15 ml of sodium diatrizoate 
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solution adjusted to a density of 1.077+/-0.001 g/ml (Histopaque 1077, Sigma) in a 50mL plastic tube. 
The tube was spun at 800 g for 10 minutes at room temperature. The plasma layer was removed to the 
30 ml mark on the tube, and the mononuclear cell layer removed into a new tube and washed with an 
equal volume of PBS, and collected by centrifugation at 2000 g for 10 minutes at room temperature. 
The cell pellet was resuspended in 10 ml of Buffer EL (Qiagen) by vortexing and incubated on ice for 
10 minutes to remove any remaining erthythrocytes. The mononuclear cells were spun at 2000 g for 10 
minutes at 4 degrees Celsius. The cell pellet was lysed in 25 ml of a phenol/guanidinium thiocyanate 
solution (TRIZOL Reagent, Invitrogen). The sample was homogenized using a PowerGene 5 rotary 
homogenizer (Fisher Scientific) and Omini disposable generator probes (Fisher Scientific). The Trizol 
lysate was frozen at -80 degrees C until the next step. 

The samples were thawed out and incubated at room temperature for 5 minutes. 5 ml chloroform was 
added to each sample, mixed by vortexing, and incubated at room temperature for 3 minutes. The 
aqueous layers were transferred to new 50 ml tubes. The aqueous layer containing total RNA was 
further purified using the Qiagen RNeasy Maxi kit (PN 75162), per the manufacturer's protocol 
(October 1999). The columns were eluted twice with 1 ml Rnase-free water, with a minute incubation 
before each spin. Quantity and quality of RNA was assessed using standard methods. Generally, RNA 
was isolated from batches of 1 0 bufry coats at a time, with an average yield per buffy coat of 870 ug, 
and an estimated total yield of 43.5 mg total RNA with a 260/280 ratio of 1.56 and a 28S/18S ratio of 
1.78. 

Quality of the RNA was tested using the Agilent 2100 Bioanalyzer using RNA 6000 microfluidics 
chips. Analysis of the electrophorgrams from the Bioanalyzer for five different batches demonstrated 
the reproducibility in quality between the batches. 

Total RNA from all five batches were combined and mixed in a 50 ml tube, then aliquoted as follows: 
2 x 10 ml aliquots in 15 ml tubes, and the rest in 100 ul aliquots in 1 .5 ml microcentrifuge tubes. The 
aliquots gave highly reproducible results with respect to RNA purity, size and integrity. The RNA was 
stored at -80°C. 

Test hybridization of Reference RNA. 

When compared with BC38 and Stimulated mononuclear reference samples, the R50 performed as 
well, if not better than the other reference samples as shown in Figure 3. In an analysis of 
hybridizations, where the R50 targets were fluorescently labeled with Cy-5 using methods described 
herein and the amplified and labeled aRNA was hybridized (as in example 1 1) to the olignoucleotide 
array described in example 9. The R50 detected 97.3% of probes with a Signal to Noise ratio (S/N) of 
greater than three and 99.9 % of probes with S/N greater than one. 

Example 4. RNA Labeling and hybridization to a leukocyte cDNA array of candidate nucleotide 
sequences. 

Comparison of Guanine-Silica to Acid-Phenol RNA Purification (GSvsAP) 
These data are from a set of 12 hybridizations designed to identify differences between the signal 
strength from two different RNA purification methods. The two RNA methods used were guanidine- 
silica (GS, Qiagen) and acid-phenol (AP, Trizol, Gibco BRL). Ten tubes of blood were drawn from 
each of four people. Two were used for the AP prep, the other eight were used for the GS prep. The 
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protocols for the leukocyte RNA preps using the AP and GS techniques were completed as described 
here: 

Guanidine-silica (GS) method: 

For each tube, 8ml blood was drawn into a tube containing the anticoagulant Citrate, 25°C density 
gradient solution and a polyester gel barrier that upon centrifugation is permeable to RBCs and 
granulocytes but not to mononuclear cells. The mononuclear cells and plasma remained above the 
barrier while the RBCs and granulocytes were trapped below. CPT tubes from Becton-Dickinson 
(#362753) were used for this purpose. The tube was inverted several times to mix the blood with the 
anticoagulant. The tubes were immediately centrifuged @1750xg in a swinging bucket rotor at room 
temperature for 20 min. The tubes were removed from the centrifuge and inverted 5-10 times. This 
mixed the plasma with the mononuclear cells, while the RBCs and the granulocytes remained trapped 
beneath the gel barrier. The plasma/mononuclear cell mix was decanted into a 15ml tube and 5ml of 
phosphate-buffered saline (PBS) was added. The 15ml tubes are spun for 5 minutes at 1750xg to pellet 
the cells. The supernatant was discarded and 1.8 ml of RLT lysis buffer (guanidine isothyocyanate) 
was added to the mononuclear cell pellet. The buffer and cells were pipetted up and down to ensure 
complete lysis of the pellet. The cell lysate was then processed exactly as described in the Qiagen 
Rneasy Miniprep kit protocol (10/99 version) for total RNA isolation (including steps for 
homogenization (Qiashredder columns) and on-column DNase treatment. The purified RNA was 
eluted in 50ul of water. 
Acid-phenol (AP) method: 

For each tube, 8ml blood was drawn into a tube containing the anticoagulant Citrate, 25°C density 
gradient solution and a polyester gel barrier that upon centrifugation is permeable to RBCs and 
granulocytes but not to mononuclear cells. The mononuclear cells and plasma remained above the 
barrier while the RBCs and granulocytes were trapped below. CPT tubes from Becton-Dickinson 
(#362753) were used for this purpose. The tube was inverted several times to mix the blood with the 
anticoagulant. The tubes were immediately centrifuged @1750xg in a swinging bucket rotor at room 
temperature for 20 min. The tubes were removed from me centrifuge arid inverted 5-10 times. This 
mixed the plasma with the mononuclear cells, while the RBCs and the granulocytes remained trapped 
beneath the gel barrier. The plasma/mononuclear cell mix was decanted into a 15ml tube and 5ml of 
phosphate-buffered saline (PBS) was added. The 15ml tubes are spun for 5 minutes @1750xg to pellet 
the cells. The supernatant was discarded and the cell pellet was lysed using 0.6 mL Phenol/guanidine 
isothyocyanate (e.g. Trizol reagent, GibcoBRL). Subsequent total RNA isolation proceeded using the 
manufacturers protocol. 

RNA from each person was labeled with either Cy3 or Cy5, and then hybridized in pairs to the mini- 
array. For instance, the first array was hybridized with GS RNA from one person (Cy3) and GS RNA 
from a second person (Cy5). 

Techniques for labeling and hybridization for all experiments discussed here were completed as 
detailed above. Arrays were prepared as described in example 1. 

RNA isolated from subject samples, or control Buffy coat RNA, were labeled for hybridization to a 
cDNA array. Total RNA (up to 100 jig) was combined with 2 /xl of 100 /*M solution of an Oligo 
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(dT)12-!8 (GibcoBRL) and heated to 70°C for 10 minutes and place on ice. Reaction buffer was 
added to the tube, to a final concentration of IxRT buffer (GibcoBRL), 10 mM DTT (GibcoBRL), 0.1 
mM unlabeled dATP, dTTP, and dGTP, and 0.025 mM unlabeled dCTP, 200 pg of CAB (A. thaliana 
photosystem I chlorophyll a/b binding protein), 200 pg of RCA (A. thaliana RUBISCO activase), 0.25 
mM of Cy-3 or Cy-5 dCTP, and 400 U Superscript II RT (GibcoBRL). 

The volumes of each component of the labeling reaction were as follows: 20 ul of 5xRT buffer; 10 ul 
of 100 mM DTT;J ul of 10 mM dNTPs without dCTP; 0.5 ul of 5 mM CTP; 13 ul of H20; 0.02 ul of 
10 ng/ul CAB and RCA; I ul of 40 Units/Ml RNAseOUT Recombinant Ribonuclease Inhibitor 
(GibcoBRL); 2.5 ul of 1.0 mM Cy-3 or Cy-5 dCTP; and 2.0 ul of 200 Units/ul of Superscript II RT. 
The sample was vortexed and centrifuged. The sample was incubated at 4°C for 1 hour for first strand 
cDNA synthesis, then heated at 70°C for 10 minutes to quench enzymatic activity. 1 /il of 10 mg/ml of 
Rnase A was added to degrade the RNA strand, and the sample was incubated at 37°C for 30 minutes. 
Next, the Cy-3 and Cy-5 cDN A samples were combined into one tube. Unincorporated nucleotides 
were removed using QIAquick RCR purification protocol (Qiagen), as directed by the manufacturer. 
The sample was evaporated to dryness and resuspended in 5 /il of water. The sample was mixed with 
hybridization buffer containing 5xSSC, 0.2% SDS, 2 mg/ml Cot-1 DNA (GibcoBRL), 1 mg/ml yeast 
tRNA (GibcoBRL), and 1.6 ng/jd poly dA40-60 (Pharmacia). This mixture was placed on the 
microarray surface and a glass cover slip was placed on the array (Corning). The microarray glass slide 
was placed into a hybridization chamber (Arrraylt). The chamber was then submerged in a water bath 
overnight at 62° C. The microarray was removed from the cassette and the cover slip was removed by 
repeatedly submerging it to a wash buffer containing lxSSC, and 0.1% SDS. The microarray slide was 
washed in lxSSC/0.1% SDS for 5 minutes. The slide was then washed in 0.1%SSC/0.1% SDS for 5 
minutes. The slide was finally washed in O.lxSSC for 2 minutes. The slide was spun at 1000 rpm for 
2 minutes to dry out the slide, then scanned on a microarray scanner (Axon Instruments, Union City, 
CA.). 

Six hybridizations with 20 jig of RNA were performed for each type of RNA preparation (GS or AP). 
Since both the Cy3 and the Cy5 labeled RNA are from test preparations, there are six data points for 
each GS prepped, Cy3-labeled RNA and six for each GS-prepped, Cy5-labeled RNA. The mini array 
hybridizations were scanned on and Axon Instruments scanner using GenPix 3.0 software. The data 
presented were derived as follows. First, all features flagged as "not found" by the software were 
removed from the dataset for individual hybridizations. These features are usually due to high local 
background or other processing artifacts. Second, the median fluorescence intensity minus the 
background fluorescence intensity was used to calculate the mean background subtracted signal for 
each dye for each hybridization. In Figure 3, the mean of these means across all six hybridizations is 
graphed (n=6 for each column). The error bars are the SEM. This experiment shows that the average 
signal from AP prepared RNA is 47% of the average signal from GS prepared RNA for both Cy3 and 
Cy5. 

Generation of expression data for leukocyte genes from peripheral leukocyte samples 

Six hybridizations were performed with RNA purified from human blood leukocytes using the 

protocols given above. Four of the six were prepared using the GS method and 2 were prepared using 
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the AP method. Each preparation of leukocyte RNA was labeled with Cy3 and 10 /ig hybridized to the 
mini-array. A control RNA was batch labeled with Cy5 and 10 hybridized to each mini-array 
together with the Cy3-labeled experimental RNA. 

The control RNA used for these experiments was Control 1 : Buffy Coat RNA, as described above. 
The protocol for the preparation of that RNA is reproduced here: 
Buffy Coat RNA Isolation: 

Buffy coats were obtained from Stanford Blood Center (in total 38 individual buffy coats were used. 
Each buffy coat is derived from -350 mL whole blood from one individual. 10 ml buffy coat was 
taken and placed into a 50 ml tube and 40 ml of a hypoclorous acid (HOC1) solution (Buffer EL from 
Qiagen) was added. The tube was mixed and placed on ice for 1 5 minutes. The tube was then 
centrifuged at 2000xg for 10 minutes at 4°C. The supernatant was decanted and the cell pellet was re- 
suspended in 10 ml of hypochlorous acid solution (Qiagen Buffer EL). The tube was then centrifuged 
at 2000xg for 10 minutes at 4°C. The cell pellet was then re-suspended in 20 ml phenol/guanidine 
thiocyanate solution ( TRIZOL from GibcoBRL) for each individual buffy coat that was processed. 
The mixture was then shredded using a rotary homogenizer. The lysate was then frozen at -80°C prior 
to proceeding to RNA isolation. 

The arrays were then scanned and analyzed on an Axon Instruments scanner using GenePix 3.0 
software. The data presented were derived as follows. First, all features flagged as "not found" by the 
software were removed from the dataset for individual hybridizations. Second, control features were 
used to normalize the data for labeling and hybridization variability within the experiment. The control 
features are cDNA for genes from the plant, Arabidopsis thaliana, that were included when spotting the 
mini-array. Equal amounts of RNA complementary to two of these cDNAs were added to each of the 
samples before they were labeled. A third was pre-labeled and equal amounts were added to each 
hybridization solution before hybridization. Using the signal from these genes, we derived a 
normalization constant (LJ) according to the following formula: 

£ BOSS 

1=1 

Lj = ^ 

K XSGSS,,. 

z- 

K 

where BGSS; is the signal for a specific feature as identified in the GenePix software as the median 
background subtracted signal for that feature, N is the number of A. thaliana control features, K is the 
number of hybridizations, and L is the normalization constant for each individual hybridization. 
Using the formula above, the mean over all control features of a particular hybridization and dye (eg 
Cy3) was calculated. Then these control feature means for all Cy3 hybridizations were averaged. The 
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control feature mean in one hybridization divided by the average of all hybridizations gives a 
normalization constant for that particular Cy3 hybridization. 

The same normalization steps were performed for Cy3 and Cy5 values, both fluorescence and 
background. Once normalized, the background Cy3 fluorescence was subtracted from the Cy3 
fluorescence for each feature. Values less than 100 were eliminated from further calculations since low 
values caused spurious results. 

Figure 4 shows the average background subtracted signal for each of nine leukocyte-specific genes on 
the mini array. This average is for 3-6 of the above-described hybridizations for each gene. The error 
bars are the SEM. 

The ratio of Cy3 to Cy5 signal is shown for a number of genes. This ratio corrects for variability 
among hybridizations and allows comparison between experiments done at different times. The ratio is 
calculated as the Cy3 background subtracted signal divided by the Cy5 background subtracted signal. 
Each bar is the average for 3-6 hybridizations. The error bars are SEM. 

Together, these results show that we can measure expression levels for genes that are expressed 
specifically in sub-populations of leukocytes. These expression measurements were made with only 10 
/ig of leukocyte total RNA that was labeled directly by reverse transcription. The signal strength can 
be increased by improved labeling techniques that amplify either the starting RNA or the signal 
fluorescence. In addition, scanning techniques with higher sensitivity can be used. 
Genes in Figures 4 and 5: 



Gene Name/Description 


GenBank Accession 
Number 


Gene Name 
Abbreviatioi 


T cell-specific tyrosine kinase Mrna 


L10717 


TKTCS 


Interleukin 1 alpha (IL 1) mRNA, complete cds 


NM_000575 


ILIA 


T-cell surface antigen CD2 (Tl 1) mRNA, complete cds 


M 14362 


CD2 


Interleukin- 13 (IL-13) precursor gene, complete cds 


U31120 


IL-13 


Thymocyte antigen CD la mRNA, complete cds 


M28825 


CDla 


CD6 mRNA for T cell glycoprotein CDS 


NMJ506725 


CD6 


MHC class n HLA-DQA1 mRNA, complete cds 


U77589 


HLA-DQA1 


Granulocyte colony-stimulating factor 


M28170 


CD19 


Homo sapiens CD69 antigen 


NM 001781 


CD69 



Example 5: Clinical study to identify diagnostic zene sets useful in diagno sis and treatment of cardiac 
allograft recipients 

An observational study was conducted in which a prospective cohort of cardiac transplant recipients 
were analyzed for associations between clinical events or rejection grades and expression of a 
leukocyte candidate nucleotide sequence library. Patients were identified at 4 cardiac transplantation 
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centers while on the transplant waiting list or during their routing post-transplant care. All adult 
cardiac transplant recipients (new or re-transplants) who received an organ at the study center during 
the study period or within 3 months of the start of the study period were eligible. The first year after 
transplantation is the time when most acute rejection occurs and it is thus important to study patients 
during this period. Patients provided informed consent prior to study procedures. 
Peripheral blood leukocyte samples were obtained from all patients at the following time points: prior 
to transplant surgery (when able), the same day as routinely scheduled screening biopsies, upon 
evaluation for suspected acute rejection (urgent biopsies), on hospitalization for an acute complication 
of transplantation or immunosuppression, and when Cytomegalovirus (CMV) infection was suspected 
or confirmed. Samples were obtained through a standard peripheral vein blood draw or through a 
catheter placed for patient care (for example, a central venous catheter placed for endocardial biopsy). 
When blood was drawn from a intravenous line, care was taken to avoid obtaining heparin with the 
sample as it can interfere with downstream reactions involving the RNA. Mononuclear cells were 
prepared from whole blood samples as described in Example 2. Samples were processed within 2 
hours of the blood draw and DNA and serum were saved in addition to RNA. Samples were stored at 
-80° C or on dry ice and sent to the site of RNA preparation in a sealed container with ample dry ice. 
RNA was isolated from subject samples as described in Example 2 and hybridized to a candidate 
library of differentially expressed leukocyte nucleotide sequences, as further described in Examples 9- 
10. Methods used for amplification, labeling, hybridization and scanning are described in Example 1 1 . 
Analysis of human transplant patient mononuclear cell RNA hybridized to a microarray and 
identification of diagnostic gene sets is shown in Example 10. 

From each patient, clinical information was obtained at the following time points: prior to transplant 
surgery (when available), the same day as routinely scheduled screening biopsies, upon evaluation for 
suspected acute rejection (e.g., urgent biopsies), on hospitalization for an acute complication of 
transplantation or immunosuppression, and when Cytomegalovirus (CMV) infection was suspected or 
confirmed. Data was collected directly from the patient, from the patient's medical record, from 
diagnostic test reports or from computerized hospital databases. It was important to collect all 
information pertaining to the study clinical correlates (diagnoses and patient events and states to which 
expression data is correlated) and confounding variables (diagnoses and patient events and states that 
may result in altered leukocyte gene expression. Examples of clinical data collected are: patient sex, 
date of birth, date of transplant, race, requirement for prospective cross match, occurrence of pre- 
transplant diagnoses and complications, indication for transplantation, severity and type of heart 
disease, history of left ventricular assist devices, all known medical diagnoses, blood type, HLA type, 
viral serologies (including CMV, Hepatitis B and C, HIV and others), serum chemistries, white and red 
blood cell counts and differentials, CMV infections (clinical manifestations and methods of diagnosis), 
occurrence of new cancer, hemodynamic parameters measured by catheterization of the right or left 
heart (measures of graft function), results of echocardiography, results of coronary angiograms, results 
of intravascular ultrasound studies (diagnosis of transplant vasculopathy), medications, changes in 
medications, treatments for rejection, and medication levels. Information was also collected regarding 
the organ donor, including demographics, blood type, HLA type, results of screening cultures, results 
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of viral serologies, primary cause of brain death, the need for inotropic support, and the organ cold 
ischemia time. 

Of great importance was the collection of the results of endocardial biopsy for each of the patients at 
each visit. Biopsy results were all interpreted and recorded using the international society for heart and 
lung transplantation (ISHLT) criteria, described below. Biopsy pathological grades were determined 
by experienced pathologists at each center. 



ISHLT Criteria 



Grade 


Finding 


Rejection 
Severity 


0 


No lymphocytic infiltrates 


None 


1A 


Focal (perivascular or interstitial lymphocytic infiltrates without 
necrosis) 


Borderline 
mild 


IB 


Diffuse but sparse lymphocytic infiltrates without necrosis 


Mild 


2 


One focus only with aggressive lymphocytic infiltrate and/or 
myocyte damage 


Mild, focal 
moderate 


3A 


Multifocal aggressive lymphocytic infiltrates and/or myocardial 
damage 


Moderate 


3B 


Diffuse inflammatory lymphocytic infiltrates with necrosis 


Borderline 
Severe 


4 


Diffuse aggressive polymorphous lymphocytic infiltrates with 
edema hemorrhage and vasculitis, with necrosis 


Severe 



Because variability exists in the assignment of ISHLT grades, it was important to have a centralized 
and blinded reading of the biopsy slides by a single pathologist. This was arranged for all biopsy slides 
associated with samples in the analysis. Slides were obtained and assigned an encoded number. A 
single pathologist then read all slides from all centers and assigned an ISHLT grade. Grades from the 
single pathologist were then compared to the original grades derived from the pathologists at the study 
centers. For the purposes of correlation analysis of leukocyte gene expression to biopsy grades, the 
centralized reading information was used in a variety of ways (see Example 10 for more detail). In 
some analyses, only the original reading was used as an outcome. In other analyses, the result from the 
centralized reader was used as an outcome. In other analyses, the highest of the 2 grades was used. 
For example, if the original assigned grade was 0 and the centralized reader assigned a 1A, then 1 A 
was the grade used as an outcome. In some analyses, the highest grade was used and then samples 
associated with a Grade 1 A reading were excluded from the analysis. In some analyses, only grades 
with no disagreement between the 2 readings were used as outcomes for correlation analysis. 
Clinical data was entered and stored in a database. The database was queried to identify all patients 
and patient visits that meet desired criteria (for example, patients with > grade II biopsy results, no 
CMV infection and time since transplant < 12 weeks). 



122 



WO 2004/042346 



PCT/US2003/012946 



The collected clinical data (disease criteria) is used to define patient or sample groups for correlation of 
expression data. Patient groups are identified for comparison, for example, a patient group that 
possesses a useful or interesting clinical distinction, versus a patient group that does not possess the 
distinction. Examples of useful and interesting patient distinctions that can be made on the basis of 
collected clinical data are listed here: 

1 . Rejection episode of at least moderate histologic grade, which results in treatment of 
the patient with additional corticosteroids, anti-T cell antibodies, or total lymphoid irradiation. 

2. Rejection with histologic grade 2 or higher. 

3. Rejection with histologic grade <2. 

4. The absence of histologic rejection and normal or unchanged allograft function 
(based on hemodynamic measurements from catheterization or on echocardiography data). 

5. The presence of severe allograft dysfunction or worsening allograft dysfunction 
during the study period (based on hemodynamic measurements from catheterization or on 
echocardiographic data). 

6. Documented CMV infection by culture, histology, or PCR, and at least one clinical 
sign or symptom of infection. 

7. Specific graft biopsy rejection grades 

8. Rejection of mild to moderate histologic severity prompting augmentation of the 
patient's chronic immunosuppressive regimen 

9. Rejection of mild to moderate severity with allograft dysfunction prompting 
plasmapheresis or a diagnosis of "humoral" rejection 

10. Infections other than CMV, esp. Epstein Barr virus (EBV) 

1 1 . Lymphoproliferative disorder (also called, post-transplant lymphoma) 

12. Transplant vasculopathy diagnosed by increased intimal thickness on intravascular 
ultrasound (IVUS), angiography, or acute myocardial infarction. 

1 3 . Graft Failure or Retransplantation 

14. All cause mortality 

15. Grade 1 A or higher rejection as defined by the initial biopsy reading. 

16. Grade IB or higher rejection as defined by the initial biopsy reading. 

17. Grade 1 A or higher rejection as defined by the centralized biopsy reading. 

1 8. Grade IB or higher rejection as defined by the centralized biopsy reading. 

19. Grade 1A or higher rejection as defined by the highest of the initial and centralized 
biopsy reading. 

20. Grade IB or higher rejection as defined by the highest of the initial and centralized 
biopsy reading. 

21 . Any rejection > Grade 2 occurring in patient at any time in the post-transplant course. 
Expression profiles of subject samples are examined to discover sets of nucleotide sequences with 
differential expression between patient groups, for example, by methods describes above and below. 
Non-limiting examples of patient leukocyte samples to obtain for discovery of various diagnostic 
nucleotide sets are as follows: 
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Leukocyte set to avoid biopsy or select for biopsy: 
Samples : Grade 0 vs. Grades 1-4 

Leukocyte set to monitor therapeutic response: 
Examine successful vs. unsuccessful drug treatment. 

Samples: 

Successful: Time I : rejection, Time 2: drug therapy Time 3: no rejection 
Unsuccessful: Time 1: rejection, Time 2: drug therapy; Time 3: rejection 

Leukocyte set to predict subsequent acute rejection. 

Biopsy may show no rejection, but the patient may develop rejection shortly thereafter. Look at 
profiles of patients who subsequently do and do not develop rejection. 

Samples: 

Group 1 (Subsequent rejection): Time 1: Grade 0; Time 2: Grade>0 
Group 2 (No subsequent rejection): Time 1 : Grade 0; Time 2: Grade 0 

Focal rejection may be missed by biopsy. When this occurs the patient may have a Grade 0, but 
actually has rejection. These patients may go on to have damage to the graft etc. 

Samples: 

Non-rejectors: no rejection over some period of time 
Rejectors: an episode of rejection over same period 

Leukocyte set to diagnose subsequent or current graft failure: 

Samples: 

Echocardiographic or catheterization data to define worsening function over time and correlate to 
profiles. 

Leukocyte set to diagnose impending active CMV: 
Samples: 

Look at patients who are CMV IgG positive. Compare patients with subsequent (to a sample) clinical 
CMV infection verses no subsequent clinical CMV infection. 

Leukocyte set to diagnose current active CMV: 

Samples: 

Analyze patients who are CMV IgG positive. Compare patients with active current clinical CMV 
infection vs. no active current CMV infection. 

Upon identification of a nucleotide sequence or set of nucleotide sequences that distinguish patient 
groups with a high degree of accuracy, that nucleotide sequence or set of nucleotide sequences is 
validated, and implemented as a diagnostic test. The use of the test depends on the patient groups that 
are used to discover the nucleotide set. For example, if a set of nucleotide sequences is discovered that 
have collective expression behavior that reliably distinguishes patients with no histological rejection or 
graft dysfunction from all others, a diagnostic is developed that is used to screen patients for the need 
for biopsy. Patients identified as having no rejection do not need biopsy, while others are subjected to 
a biopsy to further define the extent of disease. In another example, a diagnostic nucleotide set that 
determines continuing graft rejection associated with myocyte necrosis (> grade I) is used to determine 
that a patient is not receiving adequate treatment under the current treatment regimen. After increased 
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or altered immunosuppressive therapy, diagnostic profiling is conducted to determine whether 
continuing graft rejection is progressing. In yet another example, a diagnostic nucleotide set(s) that 
determine a patient's rejection status and diagnose cytomegalovirus infection is used to balance 
immunosuppressive and anti-viral therapy. 

The methods of this example are also applicable to cardiac xenograft monitoring. 

Example 6: Identification of diagnostic nucleotide sets for kidney and liver allograft rejection 
Diagnostic tests for rejection are identified using patient leukocyte expression profiles to identify a 
molecular signature correlated with rejection of a transplanted kidney or liver. Blood, or other 
leukocyte source, samples are obtained from patients undergoing kidney or liver biopsy following liver 
or kidney transplantation, respectively. Such results reveal the histological grade, i.e., the state and 
severity of allograft rejection. Expression profiles are obtained from the samples as described above, 
and the expression profile is correlated with biopsy results. In the case of kidney rejection, clinical data 
is collected corresponding to urine output, level of creatine clearance, and level of serum creatine (and 
other markers of renal function). Clinical data collected for monitoring liver transplant rejection 
includes, biochemical characterization of serum markers of liver damage and function such as SGOT, 
SGPT, Alkaline phosphatase, GGT, Bilirubin, Albumin and Prothrombin time. 

Leukocyte nucleotide sequence expression profiles are collected and correlated with important clinical 
states and outcomes in renal or hepatic transplantation. Examples of useful clinical correlates are given 
here: 

1. Rejection episode of at least moderate histologic grade, which results in treatment of the 
patient with additional corticosteriods, anti-T cell antibodies, or total lymphoid irradiation. 

2. The absence of histologic rejection and normal or unchanged allograft function (based on tests 
of renal or liver function listed above). 

3. The presence of severe allograft dysfunction or worsening allograft dysfunction during the 
study period (based on tests of renal and hepatic function listed above). 

4. Documented CMV infection by culture, histology, or PCR, and at least one clinical sign or 
symptom of infection. 

5. Specific graft biopsy rejection grades 

6. Rejection of mild to moderate histologic severity prompting augmentation of the patient's 
chronic immunosuppressive regimen 

7. Infections other than CMV, esp. Epstein Barr virus (EBV) 

8. Lymphoproliferative disorder (also called, post-transplant lymphoma) 

9. Graft Failure or Retransplantation 

10. Need for hemodialysis or other renal replacement therapy for renal transplant patients. 

11. Hepatic encephalopathy for liver transplant recipients. 

12. All cause mortality 

Subsets of the candidate library (or of a previously identified diagnostic nucleotide set), are identified, 
according to the above procedures, that have predictive and/or diagnostic value for kidney or liver 
allograft rejection. 
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Example 7: Identification of a diagnostic nucleotide set for diagnosis of cvtome palnvirin; 
Cytomegalovirus is a very important cause of disease in immunocompromised patients, for example, 
transplant patients, cancer patients, and AIDS patients. The virus can cause inflammation and disease 
in almost any tissue (particularly the colon, lung, bone marrow and retina). It is increasingly important 
to identify patients with current or impending clinical CMV disease, particularly when 
immunosuppressive drugs are to be used in a patient, e.g. for preventing transplant rejection. 
Leukocytes are profiled in patients with active CMV, impending CMV, or no CMV. Expression 
profiles correlating with diagnosis of active or impending CMV are identified. Subsets of the candidate 
library (or a previously identified diagnostic nucleotide set) are identified, according to the above 
procedures that have predictive value for the diagnosis of active or impending CMV. Diagnostic 
nucleotide set(s) identified with predictive value for the diagnosis of active or impending CMV may be 
combined, or used in conjunction with, cardiac, liver and/or kidney allograft-related diagnostic gene 
set(s) (described in Examples 6 and 10). 

In addition, or alternatively, CMV nucleotide sequences are obtained, and a diagnostic nucleotide set is 
designed using CMV nucleotide sequence. The entire sequence of the organism is known and all CMV 
nucleotide sequences can be isolated and added to the library using the sequence information and the 
approach described below. Known expressed genes are preferred. Alternatively, nucleotide sequences 
are selected to represent groups of CMV genes that are coordinately expressed (immediate early genes, 
early genes, and late genes) (Spector et al. 1990, Stamminger et al. 1990). 

Oligonucleotides were designed for CMV genes using the oligo design procedures of Example 8. 
Probes were designed using the 14 gene sequences shown here and were included on the array 
described in example 9: 





HCMVTRL2 (IRL2) 


1893..2240 




HCMVTRL7 (IRL7) 


complement(6595 . .6843) 




HCMVUL21 


complement(26497. .27024) 




HCMVUL27 


complement(3283 1.. 34657) 




HCMVUL33 


43251. .44423 


Cytomegalovirus 


HCMVUL54 


complement(76903 ..8063 1 ) 


(CMV) 


HCMVUL75 


complement 1 0790 1..1 10132) 


Accession #X 17403 


HCMVUL83 


complement 11 9352.. 12 1037) 




HCMVUL106 


complement 1 54947.. 1 55324) 




HCMVUL109 


complement 1 575 1 4.. 1 578 1 0) 




HCMVUL113 


161 503.. 162800 




HCMVUL122 


complement 1 69364.. 1 70599) 




HCMVUL123 (last exon at 3 '-end) 


complement 1 7 1 006.. 1 72225) 




HCMVUS28 


219200..220171 



Diagnostic nucleotide set(s) for expression of CMV genes is used in combination with diagnostic 
leukocyte nucleotide sets for diagnosis of other conditions, e.g. organ allograft rejection. 



126 



WO 2004/042346 



PCT/US2003/012946 



Using the techniques described in example 2 mononuclear samples from 180 cardiac transplant 
recipients (enrolled in the study described in Example 5) were used for expression profiling with the . 
leukocyte arrays. Of these samples 15 were associated with patients who had a diagnosis of primary or 
reactivation CMV made by culture, PCR or any specific diagnostic test. 

After preparation of RNA, amplification, labeling, hybridization, scanning, feature extraction and data 
processing were done as described in Example 1 1 using the oligonucleotide microarrays described in 
Example 9. 

The resulting log ratio of expression of Cy3 (patient sample)/ Cy5 (R50 reference RNA) was used for 
analysis. Significance analysis for microarrays (SAM, Tusher 2001, see Example 15) was applied to 
determine which genes were most significantly differentially expressed between these 15 CMV patients 
and the 165 non-CMV patients (Table 12). 12 genes were identified with a 0% FDR and 6 with a 0.1% 
FDR and are listed in Table 2. Some genes are represented by more than one oligonucleotide on the 
array and for 2 genes, multiple oligonucleotides from the same gene are called significant (SEQ IDs: 
3061, 3064: eomesodermin and 3031, 3040, 104, 2736: small inducible cytokine A4). 
Clinical variables were also included in the significance analysis. For example, the white blood cell 
count and the number of weeks post transplant (for the patient at the time the sample was obtained) 
were available for most of the 180 samples. The log of these variables was taken and the variables 
were then used in the significance analysis described above with the gene expression data. Both the 
white blood cell count (0.1% FDR) and the weeks post transplant (0% FDR) appeared to correlate with 
CMV status. CMV patients were more likely to have samples associated with later post transplant data 
and the lower white blood cell counts. 

These genes and variables can be used alone or in association with other genes or variables or with 
other genes to build a diagnostic gene set or a classification algorithm using the approaches described 
herein. 

Primers for real-time PCR validation were designed for some of these genes as described in Example 
13 and listed in Table 2C and the sequence listing. Using the methods described in example 13, 
primers for Granzyme B were designed and used to validate expression findings from the arrays. 6 
samples were tested (3 from patients with CMV and 3 from patients without CMV). The gene was 
found to be differentially expressed between the patients with and without CMV (see example 13 for 
full description). This same approach can be used to validate other diagnostic genes by real-time PCR. 
Diagnostic nucleotide sets can also be identified for a variety of other viral diseases (Table 1) using this 
same approach. 

cDNA microarrays may be used to monitor viral expression. In addition, these methods may be used to 
monitor other viruses, such as Epstein-Barr virus, Herpes Simplex 1 and vesicular stomatitis vims. 
Example 8- Design of oligonucleotide probes 

By way of example, this section describes the design of four oligonucleotide probes using Array 
Designer Ver 1.1 (Premier Biosoft International, Palo Alto, CA). The major steps in the process are 
given first. 

Obtain best possible sequence of mRNA from GenBank. If a full-length sequence reference sequence 
is not available, a partial sequence is used, with preference for the 3' end over the 5' end. When the 
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sequence is known to represent the antisense strand, the reverse complement of the sequence is used for 

probe design. For sequences represented in the subtracted leukocyte expression library that have no 

significant match in GenBank at the time of probe design, our sequence is used. 

Mask low complexity regions and repetitive elements in the sequence using an algorithm such as 

RepeatMasker. 

Use probe design software, such as Array Designer, version 1.1, to select a sequence of 50 residues 
with specified physical and chemical properties. The 50 residues nearest the 3' end constitute a search 
frame. The residues it contains are tested for suitability. If they don't meet the specified criteria, the 
search frame is moved one residue closer to the 5' end, and the 50 residues it now contains are tested. 
The process is repeated until a suitable 50-mer is found. 

If no such 50-mer occurs in the sequence, the physical and chemical criteria are adjusted until a suitable 
50-mer is found. 

Compare the probe to dbEST, the UniGene cluster set, and the assembled human genome using the 
BLASTn search tool at NCBI to obtain the pertinent identifying information and to verify that the 
probe does not have significant similarity to more than one known gene. 
Clone 40H12 

Clone 40H12 was sequenced and compared to the nr, dbEST, and UniGene databases at NCBI using 
the BLAST search tool. The sequence matched accession number NM_002310, a 'curated RefSeq 
project' sequence, see Pruitt et al. (2000) Trends Genet. 16:44-47, encoding leukemia inhibitory factor 
receptor (LIFR) mRNA with a reported E value of zero. An E value of zero indicates there is, for all 
practical purposes, no chance that the similarity was random based on the length of the sequence and 
the composition and size of the database. This sequence, cataloged by accession number NM_002310, 
is much longer than the sequence of clone 40H12 and has a poly-A tail. This indicated that the 
sequence cataloged by accession number NM_0023 1 0 is the sense strand and a more complete 
representation of the mRNA than the sequence of clone 40H12, especially at the 3' end. Accession 
number "NM002310" was included in a text file of accession numbers representing sense strand 
mRNAs, and sequences for the sense strand mRNAs were obtained by uploading a text file containing 
desired accession numbers as an Entrez search query using the Batch Entrez web interface and saving 
the results locally as a FASTA file. The following sequence was obtained, and the region of alignment 
of clone 40H12 is outlined: 

CTCTCTCCCAGAACGTGTCTCTGCTGCAAGGCACCGGGCCCTTTCGCTCTGCAGAACTGCACTTGCAAGA 
CCATTATCAACTCCTAATCCCAGCTCAGAAAGGGAGCCTCTGCGACTCATTCATCGCCCTCCAGGACTGA 
CTGCATTGCACAGATGATGGATATTTACGTATGTTTGAAACGACCATCCTGGATGGTGGACAATAAAAGA 
ATGAGGACTGCTTCAAATTTCCAGTGGCTGTTATCAACATTTATTCTTCTATATCTAATGAATCAAGTAA 
ATAGCCAGAAAAAGGGGGCTCCTCATGATTTGAAGTGTGTAACTAACAATTTGCAAGTGTGGAACTGTTC 
TTGGAAAGCACCCTCTGGAACAGGCCGTGGTACTGATTATGAAGTTTGCATTGAAAACAGGTCCCGTTCT 
TGTTATCAGTTGGAGAAAACCAGTATTAAAATTCCAGCTCTTTCACATGGTGATTATGAAATAACAATAA 
ATTCTCTACATGATTTTGGAAGTTCTACAAGTAAATTCACACTAAATGAACAAAACGTTTCCTTAATTCC 
AGATACTCCAGAGATCTTGAATTTGTCTGCTGATTTCTCAACCTCTACATTATACCTAAAGTGGAACGAC 
AGGGGTTCAGTTTTTCCACACCGCTCAAATGTTATCTGGGAAATTAAAGTTCTACGTAAAGAGAGTATGG 
AGCTCGTAAAATTAGTGACCCACAACACAACTCTGAATGGCAAAGATACACTTCATCACTGGAGTTGGGC 
CTCAGATATGCCCTTGGAATGTGCCATTCATTTTGTGGAAATTAGATGCTACATTGACAATCTTCATTTT 
TCTGGTCTCGAAGAGTGGAGTGACTGGAGCCCTGTGAAGAACATTTCTTGGATACCTGATTCTCAGACTA 
AGGTTTTTCCTCAAGATAAAGTGATACTTGTAGGCTCAGACATAACATTTTGTTGTGTGAGTCAAGAAAA 
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AGTGTTATCAGCACTGATTGGCCATACAAACTGCCCCTTGATCCATCTTGATGGGGAAAATGTTGCAATC 

AAGATTCGTAATATTTCTGTTTCTGCAAGTAGTGGAACAAATGTAGTTTTTACAACCGAAGATAACATAT 

TTGGAACCGTTATTTTTGCTGGATATCCACCAGATACTCCTCAACAACTGAATTGTGAGACACATGATTT 

AAAAGAAATTATATGTAGTTGGAATCCAGGAAGGGTGACAGCGTTGGTGGGCCCACGTGCTACAAGCTAC 

ACTTTAGTTGAAAGTTTTTCAGGAAAATATGTTAGACTTAAAAGAGCTGAAGCACCTACAAACGAAAGCT 

ATCAATTATTATTTCAAATGCTTCCAAATCAAGAAATATATAATTTTACTTTGAATGCTCACAATCCGCT 

GGGTCGATCACAATCAACAATTTTAGTTAATATAACTGAAAAAGTTTATCCCCATACTCCTACTTCATTC 

AAAGTGAAGGATATTAATTCAACAGCTGTTAAACTTTCTTGGCATTTACCAGGCAACTTTGCAAAGATTA 

ATTTTTTATGTGAAATTGAAATTAAGAAATCTAATTCAGTACAAGAGCAGCGGAATGTCACAATCAAAGG 

AGTAGAAAATTCAAGTTATCTTGTTGCTCTGGACAAGTTAAATCCATACACTCTATATACTTTTCGGATT 

CGTTGTTCTACTGAAACTTTCTGGAAATGGAGCA/^ATGGAGCAATAAAAAACAACATTTAACAACAGAAG 

CCAGTCCTTCAAAGGGGCCTGATACTTGGAGAGAGTGGAGTTCTGATGGAAAAAATTTAATAATCTATTG 

GAAGCCTTTACCCATTAATGAAGCTAATGGAAAAATACTTTCCTACAATGTATCGTGTTCATCAGATGAG 

GAAACACAGTCCCTTTCTGAAATCCCTGATCCTCAGCACAAAGC^GAGATACGACTTGATAAGAATGACT 

ACATCATCAGCGTAGTGGCTAAAAATTCTGTGGGCTCATCACCACCTTCCAAAATAGCGAGTATGGAAAT 

TCCAAATGATGATCTCAAAATAGAACAAGTTGTTGGGATGGGAAAGGGGATTCTCCTCACCTGGCATTAC 

GACCCCAACATGACTTGCGACTACGTCATTAAGTGGTGTAACTCGTCTCGGTCGGAACCATGCCTTATGG 

ACTGGAGAAAAGTTCCCTCAAACAGCACTGAAACTGTAATAGAATCTGATGAGTTTCGACCAGGTATAAG 

ATATAATTTTTTCCTGTATGGATGCAGAAATCAAGGATATCAATTATTACGCTCCATGATTGGATATATA 

G7^AGAATTGGCTCCCATTGTTGCACCAAATTTTACTGTTGAGGATACTTCTGCAGATTCGATATTAGTAA 

AATGGGAAGACATTCCTGTGGAAGAACTTAGAGGCTTTTTAAGAGGATATTTGTTTTACTTTGGAAAAGG 

AGAAAGAGACACATCTAAGATGAGGGTTTTAGAATCAGGTCGTTCTGACATAAAAGTTAAGAATATTACT 

GACATATCCCAGAAGACACTGAGAATTGCTGATCTTCAAGGTAAAACAAGTTACCACCTGGTCTTGCGAG 

CCTATACAGATGGTGGAGTGGGCCCGGAGAAGAGTATGTATGTGGTGACAAAGGAAAATTCTGTGGGATT 

AATTATTGCCATTCTCATCCCAGTGGCAGTGGCTGTCATTGTTGGAGTGGTGACAAGTATCCTTTGCTAT 

CGGAAACGAGAATGGATTAT^AGAAACCTTCTACCCTGATATTCCAAATCCAGAAAACTGTAAAGCATTAC 

AGTTTCAAAAGAGTGTCTGTGAGGGAAGCAGTGCTCTTAAAACATTGGAAATGAATCCTTGTACCCCAAA 

TAATGTTGAGGTTCTGGAAACTCGATCAGCATTTCCTAAAATAGAAGATACAGAAATAATTTCCCCAGTA 

GCTGAGCGTCCTGAAGATCGCTCTGATGCAGAGCCTGAAAACCATGTGGTTGTGTCCTATTGTCCACCCA 

TCATTGAGGAAGAAATACCAAACCCAGCCGCAGATGAAGCTGGAGGGACTGCACAGGTTATTTACATTGA 

TGTTCAGTCGATGTATCAGCCTCAAGCAAAACCAGAAGAAGAACAAGAAAATGACCCTGTAGGAGGGGCA 

GGCTATAAGCCACAGATGCACCTCCCCATTAATTCTACTGTGGAAGATATAGCTGCAGAAGAGGACTTAG 

ATAAAACTGCGGGTTACAGACCTCAGGCCAATGTAAATACATGGAATTTAGTGTCTCCAGACTCTCCTAG 

ATCCATAGACAGCAACAGTGAGATTGTCTCATTTGGAAGTCCATGCTCCATTAATTCCCGACAATTTTTG 

ATTCCTCCTAAAGATGAAGACTCTCCTAAATCTAATGGAGGAGGGTGGTCCTTTACAAACTTTTTTCAGA 

ACAAACCAAACGATTAACAGTGTCACCGTGTCACTTCAGTCAGCCATCTCAATAAGCTCTTACTGCTAGT 

GTTGCTACATCAGCACTGGGCATTCTTGGAGGGATCCTGTGAAGTATTGTTAGGAGGTGAACTTCACTAC 

ATGTTAAGTTACACTGAAAGTTCATGTGCTTTTAATGTAGTCTAAAAGCCAAAGTATAGTGACTCAGAAT 

CCTCAATCCACAAAACTCAAGATTGGGAGCTCTTTGTGATCAAGCCAAAGAATTCTCATGTACTCTACCT 

TCAAGAAGCATTTCAAGGCTAATACCTACTTGTACGTACATGTAAAACAAATCCCGCCGCAACTGTTTTC 

TGTTCTGTTGTTTGTGGTTTTCTCATATGTATACTTGGTGGAATTGTAAGTGGATTTGCAGGCCAGGGAG 

AAAATGTCCAAGTAACAGGTGAAGTTTATTTGCCTGACGTTTACTCCTTTCTAGATGAAAACCAAGCACA 

GATTTTAAAACTTCTAAGATTATTCTCCTCTATCCACAGCATTCACAAAAATTAATATAATTTTTAATGT 

AGTGACAGCGATTTAGTGTTTTGTTTGATAAAGTATGCTTATTTCTGTGCCTACTGT ATAATGGTTATCA 

AACAG TTGTCTCAGGGGTACAAACTTTGAAAACAAGTGTGACACTGACCAGCCCAAAltCATAATCATGTT 

TTCTTGCTGTGATAGGTTTTGCTTGCCTTTTCATTATTTTTTAGCTTTTATGCTTGCTTCCATTATTTCA 

GTTGGTTGCCCTAATATTTAAAATTTACACTTCTAAGACTAGAGACCCACATTTTTTAAAAATCATTTTA 

TTTTGTGATACAGTGACAGCTTTATATGAGCAAATTCAATATTATTCATAAGCATGTAATTCCAGTGACT 

TACTATGTGAGATGACTACTAAGCAATATCTAGCAGCGTTAGTTCCATATAGTTCTGATTGGATTTCGTT 

CCTCCTGAGGAGACCATGCCGTTGAGCTTGGCTACCCAGGCAGTGGTGATCTTTGACACCTTCTGGTGGA 

TGTTCCTCCCACTCATGAGTCTTTTCATCATGCCACATTATCTGATCCAGTCCTCACATTTTTAAATATA 

AAACTAAAGAGAGAATGCTTCTTACAGGAACAGTTACCCAAGGGCTGTTTCTTAGTAACTGTCATAAACT 

GATCTGGATCCATGGGCATACCTGTGTTCGAGGTGCAGCAATTGCTTGGTGAGCTGTGCAGAATTGATTG 

CCTTCAGCACAGCATCCTCTGCCCACCCTTGTTTCTCATAAGCGATGTCTGGAGTGATTGTGGTTCTTGG 

AAAAGCAGAAGGAAAAACTAAAAAGTGTATCTTGTATTTTCCCTGCCj CTCAGGTTGCCTATGTATTTTAC 

CTTTTCATATTTAAGGCAAAAGTACTTGAAAATTTTAAGTGTCCGAATAAGATATGTCTTTTTTGTTTGT 

TTTTTOXGGTT GGTTGTTTGTTTTTTATCATCT 

AATTTTTTTTGAAGCTCATATTGTATCTTTTTAAAAACCATGTTGTGGAAAAAAGCCAGAGTGACAAGTG 
ACAAAATCTATTTAGGAACTCTGTGTATGAATCCTGATTTTAACTGCTAGGATTCAGCTAAATTTCTGAG 
CTTTATGATCTGTGGAAATTTGGAATGAAATCGAATTCATTTTGTACATACATAGTATATTAAAACTATA 
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TAATAGTTCATAGAAATGTTCAGTAATGAAAAAATATATCCAATCAGAGCCATCCCGAAAAAAAAAAAAA 
AA (SEQ ID NO: 3101) 

The FASTA file, including the sequence of NM_002310, was masked using the RepeatMasker web 
interface (Smit, AFA & Green, P RepeatMasker at 

http://ftp.genome.washington.edu/RM/RepeatMasker.html, Smit and Green). Specifically, during 
masking, the following types of sequences were replaced with "N's": SINE/MIR & LINE/L2, LINE/LI 
, LTR/MaLR, LTR/Retroviral , AIu, and other low informational content sequences such as simple 
repeats. Below is the sequence following masking: 

CTCTCTCCCAGAACGTGTCTCTGCTGCAAGGCACCGGGCCCTTTCGCTCTGCAGAACTGCACTTGCAAG 
ACCATTATCAACTCCTAATCCCAGCTCAGAAAGGGAGCCTCTGCGACTCATTCATCGCCCTCCAGGACT 
GACTGCATTGCACAGATGATGGATATTTACGTATGTTTGAAACGACCATCCTGGATGGTGGACAATAAA 
AGAATGAGGACTGCTTCAAATTTCCAGTGGCTGTTATCAACATTTATTCTTCTATATCTAATGAATCAA 
GTAAATAGCCAGAAAAAGGGGGCTCCTCATGATTTGAAGTGTGTAACTAACAATTTGCAAGTGTGGAAC 
TGTTCTTGGAAAGCACCCTCTGGAACAGGCCGTGGTACTGATTATGAAGTTTGCATTGAAAACAGGTCC 
CGTTCTTGTTATCAGTTGGAGAAAACCAGTATTAAAATTCCAGCTCTTTCACATGGTGATTATGAAATA 
ACAATAAATTCTCTACATGATTTTGGAAGTTCTACAAGTAAATTCACACTAAATGAACAAAACGTTTCC 
TTAATTCCAGATACTCCAGAGATCTTGAATTTGTCTGCTGATTTCTCAACCTCTACATTATACCTAAAG 
TGGAACGACAGGGGTTCAGTTTTTCCACACCGCTCAAATGTTATCTGGGAAATTAAAGTTCTACGTAAA 
GAGAGTATGGAGCTCGT7UVAATTAGTGACCCACAACACAACTCTGAATGGCAAAGATACACTTCATCAC 
TGGAGTTGGGCCTCAGATATGCCCTTGGAATGTGCCATTCATTTTGTGGAAATTAGATGCTACATTGAC 
AATCTTCATTTTTCTGGTCTCGAAGAGTGGAGTGACTGGAGCCCTGTGAAGAACATTTCTTGGATACCT 
GATTCTCAGACTAAGGTTTTTCCTCAAGATAAAGTGATACTTGTAGGCTCAGACATAACATTTTGTTGT 
GTGAGTCAAGAAAAAGTGTTATCAGCACTGATTGGCCATACAAACTGCCCCTTGATCCATCTTGATGGG 
GAAAATGTTGCAATCAAGATTCGTAATATTTCTGTTTCTGCAAGTAGTGGAACAAATGTAGTTTTTACA 
ACCGAAGATAACATATTTGGAACCGTTATTTTTGCTGGATATCCACCAGATACTCCTCAACAACTGAAT 
TGTGAGACACATGATTTAAAAGAAATTATATGTAGTTGGAATCCAGGAAGGGTGACAGCGTTGGTGGGC 
CCACGTGCTACAAGCTACACTTTAGTTGAAAGTTTTTCAGGAAAATATGTTAGACTTAAAAGAGCTGAA 
GCACCTACAAACGAAAGCTATCAATTATTATTTCAAATGCTTCCAAATCAAGAAATATATAATTTTACT 
TTGAATGCTCACAATCCGCTGGGTCGATCACAATCAACAATTTTAGTTAATATAACTGAAAAAGTTTAT 
CCCCATACTCCTACTTCATTCT^AAGTGAAGGATATTAATTCAACAGCTGTTAAACTTTCTTGGCATTTA 
CCAGGCAACTTTGCAAAGATTAATTTTTTATGTGAAATTGAAATTAAGAAATCTAATTCAGTACAAGAG 
CAGCGGAATGTCACAATCAAAGGAGTAGAAAATTCAAGTTATCTTGTTGCTCTGGACAAGTTAAATCCA 
TACACTCTATATACTTTTCGGATTCGTTGTTCTACTGAAACTTTCTGGAAATGGAGCAAATGGAGCAAT 
AAAAAACAACATTTAACAACAGAAGCCAGTCCTTCAAAGGGGCCTGATACTTGGAGAGAGTGGAGTTCT 
GATGGAAAAAATTTAATAATCTATTGGAAGCCTTTACCCATTAATGAAGCTAATGGAAAAATACTTTCC 
TACAATGTATCGTGTTCATCAGATGAGGAAACACAGTCCCTTTCTGAAATCCCTGATCCTCAGCACT^AA 
GCAGAGATACGACTTGATAAGAATGACTACATCATCAGCGTAGTGGCTAAAAATTCTGTGGGCTCATCA 
CCACCTTCCAAAATAGCGAGTATGGAAATTCCAAATGATGATCTCAAAATAGAACAAGTTGTTGGGATG 
GGAAAGGGGATTCTCCTCACCTGGCATTACGACCCCAACATGACTTGCGACTACGTCATTAAGTGGTGT 
AACTCGTCTCGGTCGGAACCATGCCTTATGGACTGGAGAAAAGTTCCCTCAAACAGCACTGAAACTGTA 
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ATAGAATCTGATGAGTTTCGACCAGGTATAAGATATAATTTTTTCCTGTATGGATGCAGAAATCAAGGA 

TATCAATTATTACGCTCCATGATTGGATATATAGAAGAATTGGCTCCCATTGTTGCACCAAATTTTACT 

GTTGAGGATACTTCTGCAGATTCGATATTAGTAAAATGGGAAGACATTCCTGTGGAAGAACTTAGAGGC 

TTTTTAAGAGGATATTTGTTTTACTTTGGAAAAGGAGAAAGAGACACATCTAAGATGAGGGTTTTAGAA 

TCAGGTCGTTCTGACATAAAAGTTAAGAATATTACTGACATATCCCAGAAGACACTGAGAATTGCTGAT 

CTTCAAGGTAAAACAAGTTACCACCTGGTCTTGCGAGCCTATACAGATGGTGGAGTGGGCCCGGAGAAG 

AGTATGTATGTGGTGACAAAGGAAAATTCTGTGGGATTAATTATTGCCATTCTCATCCCAGTGGCAGTG 

GCTGTCATTGTTGGAGTGGTGACAAGTATCCTTTGCTATCGGAAACGAGAATGGATTAAAGAAACCTTC 

TAC CCTGATATTC CAAATCCAGAAAACTGTAAAGCATTACAGTTTCAAAAGAGTGTCTGTGAGGG AAGC 

AGTGCTCTTAAAACATTGGAAATGAATCCTTGTACCCCAAATAATGTTGAGGTTCTGGAAACTCGATCA 

GCATTTCCTAAAATAGAAGATACAGAAATAATTTCCCCAGTAGCTGAGCGTCCTGAAGATCGCTCTGAT 

GCAGAGCCTGAAAACCATGTGGTTGTGTCCTATTGTCCACCCATCATTGAGGAAGAAATACCAAACCCA 

GCCGCAGATGAAGCTGGAGGGACTGCACAGGTTATTTACATTGATGTTCAGTCGATGTATCAGCCTCAA 

GCAAAACCAGAAGAAGAACAAGAAAATGACCCTGTAGGAGGGG CAGG CTATAAGCCACAGATGCACCTC 

CCCATTAATTCTACTGTGGAAGATATAGCTGCAGAAGAGGACTTAGATTUU^ACTGCGGGTTACAGACCT 

CAGGCCAATGTAAATACATGGAATTTAGTGTCTCCAGACTCTCCTAGATCCATAGACAGCAACAGTGAG 

ATTGTCTCATTTGGAAGTCCATGCTCCATTAATTCCCGACAATTTTTGATTCCTCCTAAAGATGAAGAC 

TCTCCTAAATCTAATGGAGGAGGGTGGTCCTTTACAAACTTTTTTCAGAACAAACCAAACGATTAACAG 

TGTCACCGTGTCACTTCAGTCAGCCATCTCAATAAGCTCTTACTGCTAGTGTTGCTACATCAGCACTGG 

GCATTCTTGGAGGGATCCTGTGAAGTATTGTTAGGAGGTGAACTTCACTACATGTTAAGTTACACTGAA 

AGTTCATGTGCTTTTAATGTAGTCTAAAAGCCAAAGTATAGTGACTCAGAATCCTCAATCCACAAAACT 

CAAGATTGGGAGCTCTTTGTGATCAAGCCAAAGAATTCTCATGTACTCTACCTTCAAGAAGCATTTCAA 

GGCTAATACCTACTTGTACGTACATGTAAAACT^AATCCCGCCGCAACTGTTTTCTGTTCTGTTGTTTGT 

GGTTTTCTCATATGTATACTTGGTGGAATTGTAAGTGGATTTGCAGGCCAGGGAGAAAATGTCCAAGTA 

ACAGGTGAAGTTTATTTGCCTGACGTTTACTCCTTTCTAGATGAAAACCAAGCACAGATTTTAAAACTT 

CTAAGATTATTCTCCTCTATCCACAGCATTCACNNNNNNNNNNNNNNNNNNNNNNGTAG 

TTAGTGTTTTGTTTGATAAAGTATGCTTATTTCTGTGCCTACTGTATAATGGTTATCAAACAGTTGTCT 

caggggtacaaactttgaaaacaagtgtgacactgaccagcccaaat |cataatcatgttttcttgctg^ 

|GATAGGTTTTGCTTGCCTTTTCATTATTTTTTAGCTTTTATGCTTGCTTCCATTATTTCAGTTGGTTGd 

Icctaatatttaaaatttacacttctaagactagagacccacattttttaaaaatcattttattttgtga) 

rTACAGTGACAGCTTTATATGAGCAAATTCAATATTATTCATAAGCATGTAATTCCAGTGACTTACTATGl 

fTGAGATGACTACTAAGCAATATCTAGCAGCGTTAGTTCCATATAGTTCTGATTGGATTTCGTTCCTCClj 

iGAGGAGACCATGCCGTTGAGCTTGGCTACCCAGGCAGTGGTGATCTTTGACACCTTCTGGTGGATGTTd 

[CTCCCACTCATGAGTCTTTTCATCATGCCACATTATCTGATCCAGTCCTCACATTTTTAAATATAAAACj 

fTAAAGAGAGAATGCTTCTTACAGGAACAGTTACCCj^GGGCTGTTTCTTAGTAACTGTCATAAACTGAT] 

ICTGGATCCATGGGCATACCTGTGTTCGAGGTGCAGCAATTGCTTGGTGAGCTGTGCAGAATTGATTGCCl 

^TTCAGCACAGCATCCTCTGCCCACCCTTGTTTCTCATAAGCGATGTCTGGAGTGATTGTGGTTCTTGGAl 

|AAAGCAGAAGGAAAAACTAAAAAGTGTATCTTGTATTTTCCCTGCCj CTCAGGTTGCCTATGTATTTTAC 

CTTTTCATATTTAAGGCAAAAGTACTTGAAAATTTTAAGTGTCCGAATAAGATATGTCTTTTTTGTTTG 

TTTTTTTTGGTTGGTTGTTTGTTTTTTATCATCTGAGATTCTGTAATGTATTTGCAAATAATGGATCAA 
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TTAATTTTTTTTGAAGCTCATATTGTATCTTTTTAAAAACCATGTTGTGGAAAAAAGCCAGAGTGACAA 
GTGACAAAATCTATTTAGGAACTCTGTGTATGAATCCTGATTTT7^ACTGCTAGGATTCAGCTAAATTTC 
TGAGCTTTATGATCTGTGGAAATTTGGAATGAAATCGAATTCATTTTGTACATACATAGTATATTAAAA 
CTATATAATAGTTCAT AGAAATGTTCAGTAATGAAAAAATATATCCAATCAGAGCCATCCCGAAAA AAA 
AAAAAAAA (SEQ ID NO: 3 102). 

The length of this sequence was determined using batch, automated computational methods and the 
sequence, as sense strand, its length, and the desired location of the probe sequence near the 3* end of 
the mRNA was submitted to Array Designer Ver 1.1 (Premier Biosoft International, Palo Alto, CA). 
Search quality was set at 100%, number of best probes set at 1, length range set at 50 base pairs, Target 
Tm set at 75 C. degrees plus or minus 5 degrees, Hairpin max deltaG at 6.0 -kcal/moL, Self dimmer 
max deltaG at 6.0 -kcal/mol, Run/repeat (dinucleotide) max length set at 5, and Probe site minimum 
overlap set at 1 . When none of the 49 possible probes met the criteria, the probe site would be moved 
50 base pairs closer to the 5* end of the sequence and resubmitted to Array Designer for analysis. 
When no possible probes met the criteria, the variation on melting temperature was raised to plus and 
minus 8 degrees and the number of identical basepairs in a run increased to 6 so that a probe sequence 
was produced. 

In the sequence above, using the criteria noted above, Array Designer Ver 1 . 1 designed a probe 
corresponding to oligonucleotide number 3037 and is indicated by underlining in the sequence above. 
It has a melting temperature of 68.4 degrees Celsius and a max run of 6 nucleotides and represents one 
of the cases where the criteria for probe design in Array Designer Ver 1.1 were relaxed in order to 
obtain an oligonucleotide near the 3' end of the mRNA (Low melting temperature was allowed). 
Clone 463D12 

Clone 463D12 was sequenced and compared to the nr, dbEST, and UniGene databases at NCBI using 
the BLAST search tool. The sequence matched accession number All 84553, an EST sequence with 
the definition line "qd60a05.xl Soares_testis_NHT Homo sapiens cDNA clone IMAGE: 1733840 3 f 
similar to gb:M29550 PROTEIN PHOSPHATASE 2B CATALYTIC SUBUNIT 1 (HUMAN);, mRNA 
sequence." The E value of the alignment was 1.00 x 10* 118 . The GenBank sequence begins with a 
poly-T region, suggesting that it is the antisense strand, read 5* to 3'. The beginning of this sequence is 
complementary to the 3* end of the mRNA sense strand. The accession number for this sequence was 
included in a text file of accession numbers representing antisense sequences. Sequences for antisense 
strand mRNAs were obtained by uploading a text file containing desired accession numbers as an 
Entrez search query using the Batch Entrez web interface and saving the results locally as a FASTA 
file. The following sequence was obtained, and the region of alignment of clone 463D12 is outlined: 

TTTTTTTTTTTTTTCTTAAATAGCATTTATTTTCTCTCAAAAAGCCTATTATGTACTAACAAGTGTTCC 
TCTAAATTAGAAAGGCATCACTACTAAAATTTTATACATATTTTTTATATAAGAGAAGGAATATTGGGT 
TACAATCTGAATTTCTCTTTATGATTTCTCTTAAAGTATAGAACAGCTATTAAAATGACTAATATTGCT 
AAAATGAAGGCTACTAAATTTCCCCAAGAATTTCGGTGGAATGCCCAAAAATGGTGTTAAGATATGCAG 
AAGGGCCCATTTCAAGCAAAGCAATCTCTCCACCCCTTCATAAAAGATTTAAGCTAAAAAAAAAAAAAA 
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aagaa |gaaaatccaacagctgaagacattgggctatttataaatcttctcccagtcccccagacagcc^ 
|cacatgggggctgtaaacagctaactaaaatatctttgagactcttatgtccacacccactgacacaa(j 
IgagagctgtaaccacagtgaaactagactttgctttcctttagcaagtatgtgcctatgatagtaaactI 
jggagtaaatgtaac^ taataaaacaaatttttt^ 

cggtaaagaccacgtgaagacatccataaaattaggcaaccagtaaagatgtggagaaccagtaaactg 
tcgaaattcatcacattattttcatactttaatacagcagctttaattattggagaacatcaaagtaat 
taggtgccgaaaaacattgttattaatgaagggaacccctgacgtttgaccttttctgtaccatctata 

gccctggacttga (seq id no: 3103) 

The FASTA file, including the sequence of AA184553, was then masked using the RepeatMasker web 
interface, as shown below. The region of alignment of clone 463D12 is outlined. 



TTTTTTTTTTTTTTCTTAAATAGCATTTATTTTCTCTCAAAAAGCCTATTATGTACTAACAAGTGTTCC 
TCTAAATTAGAAAGGCATCACTACNNNNNNNNNNNNNNNNN^ 

TACAATCTGAATTTCTCTTTATGATTTCTCTTAAAGTATAGAACAGCTATTAAAATGACTAATATTGCT 
AAAATGjAC^GTACTAAATTTCdCG^ 

AAGGGCCCATTTCAAGCAAAGCAATCTCTCCACCCCTTCATAAAAGATTTAAGCTAAAAAAAAAAAAAA 
AAGAA jGAAAATCCAACAGCTGAAGACATTGGGCTATTTATAAATCTTCTCCCAGTCCCCCAGACAGCCTj 
[CACATGGGGGCTGTAAACAGCTAACTAAAATATCTTTGAGACTCTTATGTCCACACCCACTGACACAAG| 
|GAGAGCTGTAACCACAGTGAAACTAGACTTTGCTTTCCTTTAGCAAGTATGTGCCTATGATAGTAAACT| 
|GGAGTAAATGTAAC^ NNNNNNNN1WNNN^^ 

CGGTAAAGACCACGTGAAGACATCCATAAAATTAGGCAACCAGTAAAGATGTGGAGAACCAGTAAACTG 
TCGAAATTCATCACATTATTTTCATACTTTAATACAGCAGCTTTAATTATTGGAGAACATCAAAGTAAT 
TAGGTGCCGAAAAACATTGTTATTAATGAAGGGAACCCCTGACGTTTGACCTTTTCTGTACCATCTATA 
GCCCTGGACTTGA Masked version of 463D12 sequence. (SEQ ID NO:3104) 
The sequence was submitted to Array Designer as described above, however, the desired location of 
the probe was indicated at base pair 50 and if no probe met the criteria, moved in the 3* direction. The 
complementary sequence from Array Designer was used, because the original sequence was antisense. 
The oligonucleotide designed by Array Designer corresponds to oligonucleotide number 3054 and is 
complementary to the underlined sequence above. The probe has a melting temperature of 72.7 
degrees centigrade and a max run of 4 nucleotides. 
Clone 72D4 

Clone 72D4 was sequenced and compared to the nr, dbEST, and UniGene databases at NCBI using the 
BLAST search tool. No significant matches were found in any of these databases. When compared to 
the human genome draft, significant alignments were found to three consecutive regions of the 
reference sequence NT_008060, as depicted below, suggesting that the insert contains three spliced 
exons of an unidentified gene. 

Residue numbers on Matching residue 

clone 72D4 sequence numbers on NT 008060 

1-198 478646-478843 
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197_489 



479876-480168 



491 -585 



489271-489365 



Because the reference sequence contains introns and may represent either the coding or noncoding 
strand for this gene, BioCardia's own sequence file was used to design the oligonucleotide. Two 
complementary probes were designed to ensure that the sense strand was represented. The sequence of 
the insert in clone 72D4 is shown below, with the three putative exons outlined. 



IGCTGGAGCCCAGAGCACGGAAGCCTGGCCCTCCGAGGAGACAGAAGGGAGTGTCGGACA] 

|CCATGACGAGAGCTT[ TOCAGAATA 

•TCTGGAC^ 

j^CCCAAGTG 

Stgcccggcg 
^gcacgggtcag 

AGAAGATA fc 

[CCAAGTATCACCITCCAATTTCTGGGAGCAGTGCT 
AAAAAC (SEQIDNO:3106) 

The sequence was submitted to RepeatMasker, but no repetitive sequences were found. The sequence 
shown above was used to design the two 50-mer probes using Array Designer as described above. The 
probes are shown in bold typeface in the sequence depicted below. The probe in the sequence is 
oligonucleotide number 3020 (SEQ ID NO: 3020) and the complementary probe is oligonucleotide 
number 318 (SEQ ID NO:318). A portion of the target sequence is listed below (SEQ ID: 3106). 

CAGGTCACACAGCACATCAGTGGCTACATGTGAGCTCAGACCTGGGTCTGCTGCTGTCTGTCTTCCCAA 
TATCCATGACCTTGACTGATGCAGGTGTCTAGGGATACGTCCATCCCCGTCCTGCTGGAGCCCAGAGCA 
CGGAAGCCTGGCCCTCCGAGGAGACAGAAGGGAGTGTCGGACACCATGACGAGAGCTTGGCAGAATAAA 
TAACTTCTTTAAACAATTTTACGGCATGAAGAAATCTGGACCAGTTTATTAAATGGGATTTCTGCCACA 
AACCTTGGAAGAATCACATCATCTTANNCCCAAGTGAAAACTGTGTTGCGTAACAAAGAACATGACTGC 
GCTCCACACATACATCATTGCCCGGCGAGGCGGGACACAAGTCAACGACGGAACACTTGAGACAGGCCT 
ACAACTGTGCACGGGTCAGAAGCAAGTTTAAGCCATACTTGCTGCAGTGAGACTACATTTCTGTCTATA 
GAAGATACCTGACTTGATCTGTTTTTCAGCTCCAGTTCCCAGATGTGC 

GTCAAGGGTCTACACG 

GTGTTGTGGTCCCCAAGT ATCACCTTCCAATTTCTGGGAG - - -> 
CACAACACCAGGGGTTCATAGTGGAAGGTTAAAG- 5 ' 

CAGTGCTCTGGCCGGATCCTTGCCGCGCGGATAAAAACT -> 



ICAGGTCACACAGCACATCAGTGGCTACATGTGAGCTCAGACCTGGGTCTGCTGCTGTCrG^ 



ICITCCCAATATCCATGACCTTGACTGATGCAGGTGT' 




134 



WO 2004/042346 



PCT/US2003/012946 



Confirmation of probe seaucnce 

Following probe design, each probe sequence was confirmed by comparing the sequence against 
dbEST, the UniGene cluster set, and the assembled human genome using BLASTn at NCBI. 
Alignments, accession numbers, gi numbers, UniGene cluster numbers and names were examined and 
the most common sequence used for the probe. 

Example 9 - Production of an array of 8000 spotted SOmer oligonucleotides 

We produced an array of 8000 spotted initial candidate 50mer oligonucleotides. Example 8 

exemplifies the design and selection of probes for this array. 

Sigma-Genosys (The Woodlands, TX) synthesized un-modified 50-mer oligonucleotides using 
standard phosphoramidite chemistry, with a starting scale of synthesis of 0.05 umole (see, e.g., R. 
Meyers, ed. (1995) Molecular Biology and Biotechnology: A Comprehensive Desk Reference ). 
Briefly, to begin synthesis, a 3' hydroxyl nucleoside with a dimethoxytrityl (DMT) group at the 5' end 
was attached to a solid support. The DMT group was removed with trichloroacetic acid (TCA) in order 
to free the 5*-hydroxyl for the coupling reaction. Next, tetrazole and a phosphoramidite derivative of 
the next nucleotide were added. The tetrazole protonates the nitrogen of the phosphoramidite, making 
it susceptible to nucleophilic attack. The DMT group at the 5'-end of the hydroxyl group blocks 
further addition of nucleotides in excess. Next, the inter-nucleotide linkage was converted to a 
phosphotriester bond in an oxidation step using an oxidizing agent and water as the oxygen donor. 
Excess nucleotides were filtered out and the cycle for the next nucleotide was started by the removal of 
the DMT protecting group. Following the synthesis, the oligo was cleaved from the solid support. The 
oligonucleotides were desalted, resuspended in water at a concentration of 100 or 200 uM, and placed 
in 96-deep well format. The oligonucleotides were re-arrayed into Whatman Uniplate 384-well 
polyproylene V bottom plates. The oligonucleotides were diluted to a final concentration 30 uM in IX 
Micro Spotting Solution Plus (Telechem/arrayit.com, Sunnyvale, CA) in a total volume of 15 ul. In 
total, 8,031 oligonucleotides were arrayed into twenty-one 384-well plates. 

Arrays were produced on Telechem/arrayit.com Super amine glass substrates (Telechem/arrayit.com), 
which were manufactured in 0.1 mm filtered clean room with exact dimensions of 25x76x0.96 mm. 
The arrays were printed using the Virtek Chipwriter with a Telechem 48 pin Micro Spotting Printhead. 
The Printhead was loaded with 48 Stealth SMP3B TeleChem Micro Spotting Pins, which were used to 
print oligonucleotides onto the slide with the spot size being 110-115 microns in diameter. 

Example 10: Identific ation of dia gnostic nucleotide sets for diagnosis of Cardia c Allograft Rejection 

Genes were identified which have expression patterns useful for the diagnosis and monitoring of 
cardiac allograft rejection. Further, sets of genes that work together in a diagnostic algorithm for 
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allograft rejection were identified. Patients, patient clinical data and patient samples used in the 
discovery of markers below were derived from a clinical study described in example 5. 
The collected clinical data is used to define patient or sample groups for correlation of expression data. 
Patient groups are identified for comparison, for example, a patient group that possesses a useful or 
interesting clinical distinction, verses a patient group that does not possess the distinction. Measures of 
cardiac allograft rejection are derived from the clinical data described above to divide patients (and 
patient samples) into groups with higher and lower rejection activity over some period of time or at any 
one point in time. Such data are rejection grade as determined from pathologist reading of the cardiac 
biopsies and data measuring progression of end-organ damage, including depressed left ventricular 
dysfunction (decreased cardiac output, decreased ejection fraction, clinical signs of low cardiac output) 
and usage of inotropic agents (Kobashigawa 1998). 

Expression profiles correlating with occurrence of allograft rejection are identified, including 
expression profiles corresponding to end-organ damage and progression of end-organ damage. 
Expression profiles are identified predicting allograft rejection, and response to treatment or likelihood 
of response to treatment. Subsets of the candidate library (or a previously identified diagnostic 
nucleotide set) are identified, that have predictive value for the presence of allograft rejection or 
prediction of allograft rejection or end organ damage. 

Mononuclear RNA samples were collected from patients who had recently undergone a cardiac 
allograft transplantation using the protocol described in example 2. The allograft rejection status at the 
time of sample collection was determined by examination of cardiac biopsies as described in example 
5. 

180 samples were included in the analysis. Each patient sample was associated with a biopsy and 
clinical data collected at the time of the sample. The cardiac biopsies were graded by a pathologist at 
the local center and by a centralized pathologist who read the biopsy slides from all four local centers 
in a blinded manner. Biopsy grades included 0, 1A, IB, 2, 3A, and 3B. No grade 4 rejection was 
identified. Dependent variables were developed based on these grades using either the local center 
pathology reading or the higher of the two readings, local or centralized. The dependent variables used 
for correlation of gene expression profiles with cardiac allograft rejection are shown in Table 4. 
Dependent variables are used to create classes of samples corresponding to the presence or absence of 
rejection. 

Clinical data were also used to determine criteria for including samples in the analysis. The strictest 
inclusion criteria required that samples be from patients who did not have a bacterial or viral infection, 
were at least two weeks post cardiac transplant and were not currently admitted to the hospital. A 
second inclusion criteria (inclusion 2) reduced the post-transplant criteria to 1 week and eliminated the 
hospital admission criteria. 

After preparation of RNA (example 2), amplification, labeling, hybridization, scanning, feature 
extraction and data processing were done as described in Example 11, using the oligonucleotide 
microarrays described in Example 9. The resulting log ratio of expression of Cy3 (patient sample)/ 
Cy5 (R50 reference RNA) was used for analysis. This dataset is called the "static" data. A second 
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type of dataset, referenced, was derived from the first. These datasets compared the gene expression 
log ratio in each sample to a baseline sample from the same patient using the formula: 



Two referenced datasets were used, named "0 HG" and "Best 0". The baseline for 0 HG was a Grade 0 
sample from the same patient as the sample, using the highest grade between the centralized and local 
pathologists. The baseline for Best 0 was a Grade 0 sample from the same patient as the sample, using 
both the local and centralized reader biopsy grade- data. When possible a Grade 0 prior to the sample 
was used as the baseline in both referenced datasets. 

The datasets were also divided into subsets to compare analysis between two subsets of roughly half of 
the data. The types of subsets constructed were as follows. First half/second half subsets were the first 
half of the samples and the second half of the samples from a dataset ordered by sample number. 
Odd/even subsets used the same source, a dataset ordered by sample number, but the odd subset 
consisted of every 2 nd sample starting with the first and the even subset consisted of every 2 nd sample 
starting with the second sample, Center 14/other subsets were the same datasets, divided by transplant 
hospital. The center 14 subset consisted of all samples from patients at center 14, while the other 
subset consisted of all samples from the other three centers (12,13, and 15). 

Initially, significance analysis for microarrays (SAM, Tusher 2001, Example 15) was used to discover 
genes that were differentially expressed between the rejection and no-rejection groups. Ninety-six 
different combinations of dependent variables, inclusion criteria, static/referenced, and data subsets 
were used in SAM analysis to develop the primary lists of genes significantly differentially expressed 
between rejection and no-rejection. The most significant of these genes were chosen based on the 
following criteria. Tier 1 genes were those which appeared with an FDR of less than 20% in identical 
analyses in two independent subsets. Tier 2 genes were those which appeared in the top 20 genes on 
the list with an FDR less than 20% more than 50% of the time over all dependent variables with the 
inclusion criteria, and static/referenced constant. Tier 3 genes were those that appeared more than 50% 
of the time with an FDR less than 20% more than 50% of the time over all dependent variables with the 
inclusion criteria, and static/referenced constant. The genes that were identified by the analysis as 
statistically differentially expressed between rejection and no rejection are shown in Table 2. 
SAM chooses genes as significantly different based on the magnitude of the difference between the 
groups and the variation among the samples within each group. An example of the difference between 
some Grade 0 and some Grade 3A samples for 9 genes is shown in Figure 7A. 

Additionally, many of these same combinations were used in the Supervised Harvesting of Expression 
Trees (SHET, Hastie et al. 2001) algorithm (see example 15) to identify markers that the algorithm 
chose as the best to distinguish between the rejection and no rejection classes using a bias factor of 
0.01 . The top 20 or 30 terms were taken from the SHET output and among all comparisons in either 
the static or referenced data the results were grouped. Any gene found in the top 5 terms in more than 
50% of the analyses was selected to be in group Bl (Table 2). The occurrences of each gene were 
tabulated over all SHET analysis (for either static or referenced data) and the 10 genes that occurred the 
most were selected to be in group B2 (Table 2). 
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An additional classification method used was CART (Salford Systems, San Diego, example 15). Either 
the static or referenced dataset was reduced to only the genes for which expression values (log ratios) 
were present in at least 80% of the samples. These data were used in CART with the default settings, 
using the Symmetric Gini algorithm. Each of the dependent variables was used with both the full 
sample set and the strict inclusion criteria. Two groups of genes were identified. Group CI were those 
genes that were a primary splitter (1 st decision node). Group C2 genes were the 10 genes that occurred 
as splitters the most often over all these analyses. 

Two other classification models were developed and their best genes identified as markers of cardiac 
allograft rejection. Group D genes were identified from a set of 59 samples, referenced data, local 
biopsy reading grade, using logistic regression. Group E genes were identified from the primary static 
dataset using a K-nearest neighbor classification algorithm. 

Both hierarchical clustering (Eisen et al. 1998) and CART were used to identify surrogates for each 
identified marker. Hierarchical clustering surrogates are genes co-expressed in these and were chosen 
from the nearest branches of the dendrogram. CART surrogates were identified by CART as the 
surrogates for those genes chosen as primary splitters at decision nodes. 

Primers for real-time PCR validation were designed for each of the marker genes as described in 
Example 13. 

CART was used to build a decision tree for classification of samples as rejection or no-rejection using 
the gene expression data from the arrays. The analysis identified sets of genes that can be used 
together to accurately identify samples derived from cardiac allograft transplant patients. The set of 
genes and the identified threshold expression levels for the decision tree are referred to as a "models". 
This model can be used to predict the rejection state of an unknown sample. The input data were the 
static expression data (log ratio) and the referenced expression data (log ratio referenced to the best 
available grade 0 from either the centralized reader or the local reader) for 139 of our top marker genes. 
These two types of expression data were entered into the CART software as independent variables. 
The dependent variable was rejection state, defined for this model as no rejection = grade O and 
rejection = grade 3 A. Samples were eliminated from consideration in the training set if they were from 
patients with either bacterial or viral infection or were from patients who were less than two weeks 
post-transplant. The method used was Symmetric Gini, allowing linear combinations of independent 
variables. The costs were set to 1 for both false negatives and false positives and the priors were set 
equal for the two states. No penalties were assessed for missing data, however the marker genes 
selected have strong representation across the dataset. 10-fold cross validation was used to test the 
model. Settings not specified remained at the default values. 

The model shown in Figure 7B is based on decisions about expression values at three nodes, each a 
different marker gene. The cost assigned to this model is 0.292, based on the priors being equal, the 
costs set to 1 for each type of error, and the results from the 10-fold cross validation. 
In the training set, no rejection samples were misclassified (sensitivity = 100%) and only 1 no-rejection 
sample was misclassified (specificity = 94.4%). Following 10-fold cross validation, 2 rejection 
samples were misclassified (sensitivity = 87.5%) and 3 no-rejection samples were misclassified 
(specificity = 83.3%). The CART software assigns surrogate markers for each decision node. 
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These genes can be used alone or in association with other genes or variables to build a diagnostic gene 
set or a classification algorithm. These genes can be used in association with known gene markers for 
rejection (such as those identified in the prior art) to provide a diagnostic algorithm. 

Example If- Amplification, labeling, and hybridization of total RNA to an oligonucleotide microarray 
Amplification, labeling, hybridization and scanning 

Samples consisting of at least 0.5 to 2 ug of intact total RNA were further processed for array 
hybridization. When available, 2 ug of intact total RNA is used for amplification. Amplification and 
labeling of total RNA samples was performed in three successive enzymatic reactions. First, a single- 
stranded DNA copy of the RNA was made (hereinafter, "ss-cDNA"). Second, the ss-cDNA was used 
as a template for the complementary DNA strand, producing double-stranded cDNA (hereinafter, "ds- 
cDNA, or cDNA"). Third, linear amplification was performed by in vitro transcription from a bacterial 
T 7 promoter. During this step, fluorescent-conjugated nucleotides were incorporated into the amplified 
RNA (hereinafter, "aRNA"). 

The first strand cDNA was produced using the Invitrogen kit (Superscript II). The first strand cDNA 
was produced in a reaction composed of 50 mM Tris-HCl (pH 8.3), 75 mM KC1, and 3 mM MgCl 2 (lx 
First Strand Buffer, Invitrogen), 0.5 mM dGTP, 0.5 mM dATP, 0.5 mM dTTP, 0.5 mM dCTP, 10 mM 
DTT, 200 U reverse transcriptase (Superscript II, Invitrogen, #18064014), 15 U RNase inhibitor 
(RNAGuard, Amersham Pharmacia, #27-0815-01), 5 \M T7T24 primer 
(5'-GGCCAGTGAATTGTAATACGACTCACT^ 

TTT-3')> (SEQ ID NO:3105) and 0.5 to 2 ug of selected sample total RNA. Several purified, 
recombinant control mRNAs from the plant Arabidopsis thaliana were added to the reaction mixture: 2- 
20 pg of the following genes CAB, RCA, LTP4, NAC1, RCP1,XCP2, RBCL, LTP6, TIM, and PRKase 
(Stratagene, #252201, #252202, #252204, #252208, #252207, #252206, #252203, #252205, #252209, 
#252210 respectively). The control RNAs allow the estimate of copy numbers for individual mRNAs 
in the clinical sample because corresponding sense oligonucleotide probes for each of these plant genes 
are present on the microarray. The final reaction volume of 20 ul was incubated at 42°C for 90 min. 
For synthesis of the second cDNA strand, DNA polymerase and RNase were added to the previous 
reaction, bringing the final volume to 150 The previous contents were diluted and new substrates 
were added to a final concentration of 20 mM Tris-HCl (pH 7.0) (Fisher Scientific, Pittsburgh, PA 
#BP1756-100), 90 mMKCl (Teknova, Half Moon Bay, CA, #0313-500) , 4.6 mM MgCl 2 (Teknova, 
Half Moon Bay, CA, #0304-500), 10 mM(NH 4 ) 2 S04 (Fisher Scientific #A702-500)(lx Second Strand 
buffer, Invitrogen), 0.266 mM dGTP, 0.266 mM dATP, 0.266 mM dTTP, 0.266 mM dCTP, 40 U E. 
coli DNA polymerase (Invitrogen, #18010-025), and 2 U RNaseH (Invitrogen, #18021-014). The 
second strand synthesis took place at 16°C for 150 minutes. 

Following second-strand synthesis, the ds-cDNA was purified from the enzymes, dNTPs, and buffers 
before proceeding to amplification, using phenol-chloroform extraction followed by ethanol 
precipitation of the cDNA in the presence of glycogen. 

Alternatively, a silica-gel column is used to purify the cDNA (e.g. Qiaquick PCR cleanup from Qiagen, 
#28104). The volume of the column purified cDNA was reduced by ethanol precipitation in the 
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presence of glycogen in which the cDNA was collected by centrifugation at > 10,000 xg for 30 minutes, 
the supernatant is aspirated, and 150 ul of 70% ethanol, 30% water was added to wash the DNA pellet. 
Following centrifugation, the supernatant was removed, and residual ethanol was evaporated at room 
temperature. Alternatively, the volume of the column purified cDNA is reduce in a vacuum 
evaporator where the supernatant is reduce to a final volume of 7.4 ul. 

Linear amplification of the cDNA was performed by in vitro transcription of the cDNA. The cDNA 
pellet from the step described above was resuspended in 7.4 ul of water, and in vitro transcription 
reaction buffer was added to a final volume of 20 ul containing 7.5 mM GTP, 7.5 mM ATP, 7.5 mM 
TTP, 2.25 mM CTP, 1.025 mM Cy3 -conjugated CTP (Perkin Elmer; Boston, MA, #NEL-580), Ix 
reaction buffer (Ambion, Megascript Kit, Austin, TX and #1334) and 1 % T 7 polymerase enzyme mix 
(Ambion, Megascript Kit, Austin, TX and #1334). This reaction was incubated at 37°C overnight. 
Following in vitro transcription, the RNA was purified from the enzyme, buffers, and excess NTPs 
using the RNeasy kit from Qiagen (Valencia, CA; # 74106) as described in the vendor's protocol. A 
second elution step was performed and the two eluates were combined for a final volume of 60 ul. 
RNA is quantified using an Agilent 2100 bioanalyzer with the RNA 6000 nano LabChip. 
Reference RNA was prepared as described above, except Cy5-CTP was incorporated instead of 
Cy3CTP. Reference RNA from five reactions, each reaction started with 2 ug total RNA, was pooled 
together and quantitated as described above. 

Hybridization to an array 

RNA was prepared for hybridization as follows: for an 18mmx55mm array, 20 ug of amplified RNA 
(aRNA) was combined with 20 ug of reference aRNA. The combined sample and reference aRNA was 
concentrated by evaporating the water to 10 ul in a vacuum evaporator. The sample was fragmented 
by heating the sample at 95°C for 30 minutes to fragment the RNA into 50-200 bp pieces. 
Alternatively, the combined sample and reference aRNA was concentrated by evaporating the water to 
5 ul in a vacuum evaporator. Five ul of 20 mM zinc acetate was added to the aRNA and the mix 
incubated at 60°C for 10 minutes. Following fragmentation, 40 ul of hybridization buffer was added to 
achieve final concentrations of 5xSSC and 0.20 %SDS with 0.1 ug/ul of Cot-1 DNA (Invitrogen) as a 
competitor DNA. The final hybridization mix was heated to 98°C, and then reduced to 50°C at 0. 1°C 
per second. 

Alternatively, formamide is included in the hybridization rnixture to lower the hybridization 
temperature. 

The hybridization mixture was applied to a pre-heated 65°C microarray, surface, covered with 
a glass coverslip (Coming, #2935-246), and placed on a pre-heated 65°C hybridization chamber 
(Telechem, AHC-10). 15 ul of 5xSSC was placed in each of the reservoir in the hybridization chamber 
and the chamber was sealed and placed in a water bath at 62°C for overnight (16-20 hrs). Following 
incubation, the slides were washed in 2xSSC, 0.1% SDS for five minutes at 30°C, then in 2xSSC for 
five minutes at 30°C, then in 2xSSC for another five minutes at 30°C, then in 0.2xSSC for two minutes 
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at room temperature. The arrays were spun at lOOOxg for 2 minutes to dry them. The dry microarrays 
are then scanned by methods described above. 

The microarrays were imaged on the Agilent (Palo Alto, CA) scanner G2565AA. The scan settings 
using the Agilent software were as follows: for the PMT Sensitivity (100% Red and 100% Green); 
Scan Resolution (10 microns); red and green dye channels; used the default scan region for all slides in 
the carousel; using the largest scan region; scan date for Instrument ID; and barcode for Slide ID. The 
full image produced by the Agilent scanner was flipped, rotated, and split into two images (one for 
each signal channel) using TIFFSplitter (Agilent, Palo Alto, CA). The two channels are the output at 
532 run (Cy3-labeled sample) and 633 nm (Cy5-labeled R50). The individual images were loaded into 
GenePix 3.0 (Axon Instruments, Union City, CA) for feature extraction, each image was assigned an 
excitation wavelength corresponding the file opened; Red equals 633 nm and Green equals 532 nm 
The setting file (gal) was opened and the grid was laid onto the image so that each spot in the grid 
overlaped with >50% of the feature. Then the GenePix software was used to find the features without 
setting minimum threshold value for a feature. For features with low signal intensity, GenePix reports 
"not found". For all features, the diameter setting was adjusted to include only the feature if necessary. 

The GenePix software determined the median pixel intensity for each feature (F s ) and the median pixel 
intensity of the local background for each feature (Bj) in both channels. The standard deviation (SDFj 
and SDBj) for each is also determined. Features for which GenePix could not discriminate the feature 
from the background were "flagged" as described below. 

Following feature extraction into a " .gpr" file, the header information of the .gpr file was 
changed to carry accurate information into the database. An Excel macro was written to include the 
following information: Name of the original .tif image file, SlidelD, Version of the feature extraction 
software, GenePix Array List file, GenePix Settings file, ScanID, Name of person who scanned the 
slide, Green PMT setting, Red PMT setting, ExtractID (date .gpr file was created, formatted as 
yyyy.mm.dd-hh.mm.ss), Results file name (same as the .gpr file name), StorageCD, and Extraction 
comments. 

Pre-processing with Excel Templates 

Following analysis of the image and extraction of the data, the data from each hybridization was pre- 
processed to extract data that was entered into the database and subsequently used for analysis. The 
complete GPR file produced by the feature extraction in GenePix was imported into an excel file pre- 
processing template or processed using a AWK script. Both programs used the same processing logic 
and produce identical results. The same excel template or AWK script was used to process each GPR 
file. The template performs a series of calculations on the data to differentiate poor features from 
others and to combine duplicate or triplicate feature data into a single data point for each probe. 
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The data columns used in the pre-processing were: Oligo ID, F633 Median (median value from all the 
pixels in the feature for the Cy5 dye), B633 Median (the median value of all the pixels in the local 
background of the selected feature for Cy5), B633 SD (the standard deviation of the values for the 
pixels in the local background of the selected feature for Cy5), F532 Median (median value from all the 
pixels in the feature for the Cy3 dye), B532 Median (the median value of all the pixels in the local 
background of the selected feature for Cy3), B532 SD (the standard deviation of the values for the 
pixels in the local background of the selected feature, for Cy3), and Flags. The GenePix Flags column 
contains the flags set during feature extraction. "-75" indicates there were no features printed on the 
array in that position, "-50" indicates that GenePix could not differentiate the feature signal from the 
local background, and "-100" indicates that the user marked the feature as bad. 
Once imported, the data associated with features with -75 flags was not used. Then the median of 
B633 SD and B532 SD were calculated over all features with a flag value of "0". The minimum values 
of B633 Median and B532 Median were identified, considering only those values associated with a flag 
value of "0". For each feature, the signal to noise ratio (S/N) was calculated for both dyes by taking the 
fluorescence signal minus the local background (BGSS) and dividing it by the standard deviation of the 
local background: 

1 SDB i 

If the S/N was less than 3, then an adjusted background-subtracted signal was calculated as the 
fluorescence minus the minimum local background on the slide. An adjusted S/N was then calculated 
as the adjusted background subtracted signal divided by the median noise over all features for that 
channel. If the adjusted S/N was greater than three and the original S/N were less than three, a flag of 
25 was set for the Cy5 channel, a flag of 23 was set for the Cy3 channel, and if both met these criteria, 
then a flag of 28 was set. If both the adjusted S/N and the original S/N were less than three, then a flag 
of 65 was set for Cy5, 63 set for Cy3, and 68 set if both dye channels had an adjusted S/N less than 
three. All signal to noise calculations, adjusted background-subtracted signal, and adjusted S/N were 
calculated for each dye channel. If the BGSS value was greater than or equal to 64000, a flag was set 
to indicate saturation; 55 for Cy5, 53 for Cy3, 58 for both. 

The BGSS used for further calculations was the original BGSS if the original S/N was greater than or 
equal to three. If the original S/N ratio was less than three and the adjusted S/N ratio was greater than 
or equal to three, then the adjusted BGSS was used. If the adjusted S/N ratio was less than three, then 
the adjusted BGSS was used, but with knowledge of the flag status. 

To facilitate comparison among arrays, the Cy3 and Cy5 data were scaled. The log of the ratio of 
Green/Red was detennined for all features. The median log ratio value for good features (Flags 0, 23, 
25, 28, 63) was determined. The feature values were scaled using the following formula: 
Log_Scaled_Feature_Ratio = Log_Feature_Ratio - Median_Log_Ratio. 

The flag setting for each feature was used to determine the expression ratio for each probe, a choice of 
one, two or three features. If all features had flag settings in the same category (categories=negatives, 
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0 to 28, 53-58, and 63-68), then the average of the three scaled, anti log feature ratios was calculated If 
the three features did not have flags in the same category, then the feature or features with the best 
quality flags were used (0>25>23>28>55>53>58>65>63>68). Features with negative flags were never 
used. When the best flags were two or three features in the same category, the anti log average was 
used. If a single feature had a better flag category than the other two then the anti log of that feature 
ratio was used. 

Once the probe expression ratios were calculated from the one, two, or three features, the log of the 
scaled, averaged ratios was taken as described below and stored for use in analyzing the data. 
Whichever features were used to calculate the probe value, the flag from those features was carried 
forward and stored as the flag value for that probe. 2 different data sets can be used for analysis. 
Flagged data uses all values, including those with flags. Filtered data sets are created by removing 
flagged data from the set before analysis. 

Example 12: Real-time PCR validation of array express ion results 

Leukocyte microarray gene expression was used to discover expression markers and diagnostic gene 
sets for clinical outcomes. It is desirable to validate the gene expression results for each gene using a 
more sensitive and quantitative technology such as real-time PCR. Further, it is possible for the 
diagnostic nucleotide sets to be implemented as a diagnostic test as a real-time PCR panel. 
Alternatively, the quantitative information provided by real-time PCR validation can be used to design 
a diagnostic test using any alternative quantitative or semi-quantitative gene expression technology. 
To validate the results of the microarray experiments we used real-time, or kinetic, PCR. In this type 
of experiment the amplification product is measured during the PCR reaction. This enables the 
researcher to observe the amplification before any reagent becomes rate limiting for amplification. In 
kinetic PCR the measurement is of C T (threshold cycle) or C P (crossing point). This measurement 
(Cj=Cp) is the point at which an amplification curve crosses a threshold fluorescence value. The 
threshold is set to a point within the area where all of the reactions were in their linear phase of 
amplification. When measuring C T , a lower C T value is indicative of a higher amount of starting 
material since an earlier cycle number means the threshold was crossed more quickly. 
Several fluorescence methodologies are available to measure amplification product in real-time PCR. 
Taqman (Applied BioSystems, Foster City, CA) uses fluorescence resonance energy transfer (FRET) to 
inhibit signal from a probe until the probe is degraded by the sequence specific binding and Taq 3' 
exonuclease activity. Molecular Beacons (Stratagene, La Jolla, CA) also use FRET technology, 
whereby the fluorescence is measured when a hairpin structure is relaxed by the specific probe binding 
to the amplified DNA. The third commonly used chemistry is Sybr Green, a DNA-binding dye 
(Molecular Probes, Eugene, OR). The more amplified product that is produced, the higher the signal. 
The Sybr Green method is sensitive to non-specific amplification products, increasing the importance 
of primer design and selection. Other detection chemistries can also been used, such as ethedium 
bromide or other DNA-binding dyes and many modifications of the fluorescent dye/quencher dye 
Taqman chemistry. 
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Sample prep and cDNA synthesis 

The inputs for real time PCR reaction are gene-specific primers, cDNA from specific patient samples, 
and standard reagents. The cDNA was produced from mononuclear RNA (prepared as in example 2) 
or whole blood RNA by reverse transcription using Oligo dT primers (Invitrogen, 1841 8-012) and 
random hexamers (Invitrogen, 48190-01 1) at a final concentration of 0.5ng/ul and 3ng/ul respectively. 
For the first strand reaction mix, 0.5 ug of mononuclear total RNA or 2 jig of whole blood RNA and 1 
ul of the Oligo dT/ Random Hexamer Mix, were added to water to a final volume of 1 1 .5 pi. The 
sample mix was then placed at 70*C for 10 minutes. Following the 70°C incubation, the samples were 
chilled on ice, spun down, and 88.5 u.1 of first strand buffer mix dispensed into the reaction tube. The 
final first strand buffer mix produced final concentrations of IX first strand buffer (Invitrogen, Y00146, 
Carlsbad, CA), 10 mM DTT (Invitrogen, Y00147), 0.5 mM dATP (NEB, N0440S, Beverly, MA), 0.5 
mM dGTP (NEB, N0442S), 0.5mM dTTP (NEB, N0443S), 0.5 mM dCTP (NEB, N0441S), 200U of 
reverse transcriptase (Superscript II, Invitrogen, 18064-014), and 18U of RNase inhibitor (RNAGaurd 
Amersham Pharmacia, 27-0815-01, Piscataway, NJ). The reaction was incubated at 42°C for 90 
minutes. After incubation the enzyme was heat inactivated at 70°C for 15 minutes, 2 U of RNAse H 
added to the reaction tube, and incubated at 37°C for 20 minutes. 
PRIMER DESIGN 

Two methods were used to design primers. The first was to use the software, Primer Express 0 " and 
recommendations for primer design that are provided with the GeneAmp® 7700 Sequence Detection 
System supplied by Applied BioSystems (Foster City, CA). The second method used to design primers 
was the PRIMER3 ver 0.9 program that is available from the Whitehead Research Institute, 
Cambridge, Massachusetts at the Whitehead Research web site. The program can also be accessed on 
the World Wide Web at the web site at the Massechusetts Institute of Technology website. Primers 
and Taqman/hybridization probes were designed as described below using both programs. 
The Primer Express literature explains that primers should be designed with a melting temperature 
between 58 and 60 degrees C while the Taqman probes should have a melting temperature of 68 to 70 
under the salt conditions of the supplied reagents. The salt concentration is fixed in the software. 
Primers should be between 15 and 30 basepairs long. The primers should produce and amplicon in size 
between 50 and 150 base pairs, have a C-G content between 20% and 80%, have no more than 4 
identical base pairs next to one another, and no more than 2 C's and G's in the last 5 bases of the 3' 
end. The probe cannot have a G on the 5' end and the strand with the fewest G's should be used for the 
probe. 

Primer3 has a large number of parameters. The defaults were used for all except for melting 
temperature and the optimal size of the amplicon was set at 100 bases. One of the most critical is salt 
concentration as it affects the melting temperature of the probes and primers. In order to produce . 
primers and probes with melting temperatures equivalent to Primer Express, a number of primers and 
probes designed by Primer Express were examined using PRIMER3. Using a salt concentration of 50 
mM these primers had an average melting temperature of 3.7 degrees higher than predicted by Primer 
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Express. In order to design primers and probes with equivalent melting temperatures as Primer Express 

using PRIMER3, a melting temperature of 62.7 plus/minus 1.0 degree was used in PRIMER3 for 

primers and 72.7 plus/minus 1.0 degrees for probes with a salt concentration of 50 mM. 

The C source code for Primer3 was downloaded and complied on a Sun Enterprise 250 server using the 

GCC compiler. The program was then used from the command line using a input file that contained 

the sequence for which we wanted to design primers and probes along with the input parameters as 

described by help files that accompany the software. Using scripting it was possible to input a number 

of sequences and automatically generate a number of possible probes and primers. 

Primers for p- Actin (Beta Actin, Genbank Locus: NMJ)01 101)and p-GUS: glucuronidase, beta, 

(GUSB, Genbank Locus: NM_000181), two reference genes, were designed using both methods and 

are shown here as examples: 

The first step was to mask out repetitive sequences found in the mRNA sequences using RepeatMasker 
program that can be accessed at: the web site University of Washington Genome Repeatmasker 
website. (Smit, A.F.A. & Green, P.). 

The last 500 basepairs on the last 3' end of masked sequence was then submitted to PRIMER3 using 
the following exemplary input sequences: 

PRIMER_SEQUENCE_ID=>ACTB Beta Actin (SEQID 3083) 

S EQUENCE = TTGGCTTGACTCAGGATTTAAAAACTGGAACGGTGAAGGTGACAGCAGTCGGTTGG ACGA 
GCATCCCCCAAAGTTCACAATGTGGCCGAGGACTTTGATTGCACATTGTTGTTTTTTAATAGTCATTCC 
AAATATGAGATGCATTGTTACAGGAAGTCCCTTGCCATCCTAA7VAGCACCCCACTTCTCTCTAAGGAGA 
ATGGCCCAGTCCTCTCCCAAGTCCACACAGGGGAGGGATAGCATTGCTTTCGTGTAAATTATGTAATGC 
AAAATTTTTTTAATCTTCGCCTTAATCTTTTTTATTTTGTTTTATTTTGAATGATGAGCCTTCGTGCCC 
CCCCTTCCCCCTTTTTTCCCCCAACTTGAGATGTATGAAGGCTTTTGGTCTCCCTGGGAGTGGGTGGAG 
GCAGCCGGGCTTACCTGTACACTGACTTGAGACCAGTTGAATAAAAGTGCACACCTTA 

PRIMER_SEQUENCE_ID=>GUSB (SEQID 3084) 

S EQUENCE =GAAGAGTACCAGAAAAGTCTGCTAGAGCAGTACCATCTGGGTCTGGATCAAAAACGCAGA 
AAATATGTGGTTGGAGAGCTCATTTGGAATTTTGCCGATTTCATGACTGAACAGTCACCGACGAGAGTG 
CTGGGGAATAAAAAGGGGATCTTCACTCGGCAGAGACAACCAAAAAGTGCAGCGTTCCTTTTGCGAGAG 
AGATACTGGAAGATTGCCAATGAAACCAGGTATCCCCACTCAGTAGCCAAGTCACAATGTTTGGAAAAC 
AGCCCGTTTACTTGAGCAAGACTGATACCACCTGCGTGTCCCTTCCTCCCCGAGTCAGGGCGACTTCCA 
CAGCAGCAGAACAAGTGCCTCCTGGACTGTTCACGGCAGACCAGAACGTTTCTGGCCTGGGTTTTGTGG 
TCATCTATTCTAGCAGGGAACACTAAAGGTGGAAATAAAAGATTTTCTATTATGGAAATAAAGAGTTGG 
CATGAAAGTCGCTACTG 

After running PRIMER3, 100 sets of primers and probes were generated for ACTB and GUSB. From 
this set, nested primers were chosen based on whether both left primers could be paired with both right 
primers and a single Taqman probe could be used on an insert of the correct size. With more 
experience we have decided not use the mix and match approach to primer selection and just use 
several of the top pairs of predicted primers. 

For ACTB this turned out to be: 

Forward 75 C AC AATGTGGCCGAGGACTT( SEQID 3085), 
Forward 80 TGTGGCCGAGGACTTTGATT(SEQID 3086), 
Reverse 178 TGGCTTTTAGGATGGCAAGG(SEQID 3087), and 
Reverse 168 GGGGGCTTAGTTTGCTTCCT(SEQID 3088). 

Upon testing, the F75 and R178 pair worked best. 
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For GUSB the following primers were chosen: 
Forward 59 AAGTGCAGCGTTCCTTTTGQSEQID 3089 ), 
Forward 65 AGCGTTCCTTTTGCGAGAGA (SEQID 3090 ), " 
Reverse 158 CGGGCTGTTTTCCAAACATT (SEQID 3091), and 
Reverse 197 GAAGGGACACGCAGGTGGTA (SEQID 3092). 

No combination of these GUSB pairs worked well. 

In addition to the primer pairs above, Primer Express predicted the following primers for GUSB: 
Forward 178 TACCACCTGCGTGTCCCTTC (SEQID 3093) and Reverse 242 

GAGGCACTTGTTCTGCTGCTG (SEQID 3094). This pair of primers worked to amplify the GUSB 
mRNA. 

The parameters used to predict these primers in Primer Express were: 
Primer Tm: min 58, Max=60, opt 59, max difference^ degrees 
Primer GC: min=20% Max =80% no 3' G/C clamp 
Primer: Length: min=9 max=40 opt=20 
Amplicon: min Tm=0 max Tm— 85 
min = 50 bp max = 150 bp 

Probe: Tm 10 degrees > primers, do not begin with a G on 5* end 

Other: max base pair repeat = 3 

max number of ambiguous residues = 0 

secondary structure: max consecutive bp = 4, max total bp = 8 

Uniqueness: max consecutive match - 9 

max % match = 75 

max 3' consecutive match = 7 



Granzyme B is a marker of transplant rejection. 

For Granzyme B the following sequence (NMJ)04131) (SEQID 3096) was used as input for Primer3 : 

GGGGACTCTGGAGGCCCTCTTGTGTGTAACAAGGTGGCCCAGGGCATTGT 
CTCCTATGGACGAAACAATGGCATGCCTCCACGAGCCTGCACCAAAGTCT 
CAAGCTTTGTACACTGGATAAAGAAAACCATGAAACGCTACTAACTACAG 
GAAGCAAACTAAGCCCCCGCTGTAATGAAACACCTTCTCTGGAGCCAAGT 
CCAGATTTACACTGGGAGAGGTGCCAGCAACTGAATAAATACCTCTCCCA 
GTGTAAATCTGGAGCCAAGTCCAGATTTACACTGGGAGAGGTGCCAGCAA 
CTGAATAAATACCTCTTAGCTGAGTGG 

For Granzyme B the following primers were chosen for testing: 
Forward 81 ACGAGCCTGCACCAAAGTCT (SEQID 3097) 
Forward 63 AAACAATGGCATGCCTCCAC (SEQID 3098) 
Reverse 178 TCATTACAGCGGGGGCTTAG (SEQID 3099) 
Reverse 1 68 GGGGGCTTAGTTTGCTTCCT (SEQID 3 1 00) 

Testing demonstrated that F81 and R178 worked well. 

Using this approach, primers were designed for all the genes that were shown to have expression 
patterns that correlated with allograft rejection. These primer pairs are shown in Table 2, Table 8, and 
are added to the sequence listing. Primers can be designed from any region of a target gene using this 
approach. 

PRIMER ENDPOINT TESTING 



146 



WO 2004/042346 



PCT/US2003/012946 



Primers were first tested to examine whether they would produce the correct size product without non- 
specific amplification. The standard real-time PCR protocol was used without the Rox and Sybr green 
dyes. Each primer pair was tested on cDNA made from universal mononuclear leukocyte reference 
RNA that was produced from 50 individuals as described in Example 3 (R50). 
The PCR reaction consisted of IX RealTime PCR Buffer (Ambion, Austin, TX), 2mM MgC12 
(Applied BioSystems, B02953), 0.2mM dATP (NEB), 0.2mM dTTP (NEB), 0.2mM dCTP (NEB), 
0.2mM dGTP (NEB), .625U AmpliTaq Gold (Applied BioSystems, Foster City, CA), 0.3uM of each 
primer to be used (Sigma Genosys, The Woodlands, TX), 5|il of the R50 reverse-transcription reaction 
and water to a final volume of 19uJ. 

Following 40 cycles of PCR, 10 microliters of each product was combined with Sybr green at a final 
dilution of 1 :72,000. Melt curves for each PCR product were determined on an ABI 7900 (Applied 
BioSystems, Foster City, CA), and primer pairs yielding a product with one clean peak were chosen for 
further analysis. One microliter of the product from these primer pairs was examined by agarose gel 
electrophoresis on an Agilent Bioanalyzer, DNA1000 chip (Palo Alto, CA). Results for 2 genes are 
shown in Figure 9. From the primer design and the sequence of the target gene, one can calculate the 
expected size of the amplified DNA product. Only primer pairs with amplification of the desired 
product and minimal amplification of contaminants were used for real-time PCR. Primers that 
produced multiple products of different sizes are likely not specific for the gene of interest and may 
amplify multiple genes or chromosomal loci. 



PRIMER OPTIMIZATION/EFFICIENCY 



Once primers passed the end-point PCR, the primers were tested to determine the efficiency of the 
reaction in a real-time PCR reaction. cDNA was synthesized from starting total RNA as described 
above. A set of 5 serial dilutions of the R50 reverse-transcribed cDNA (as described above) were made 
in water: 1:10, 1:20, 1:40, 1:80, and 1:160. 

The Sybr Green real-time PCR reaction was performed using the Taqman PCR Reagent kit (Applied 
BioSystems, Foster City, CA, N808-0228). A master mix was made that consisted of all reagents 
except the primes and template. The final concentration of all ingredients in the reaction was IX 
Taqman Buffer A (Applied BioSystems), 2mM MgC12 (Applied BioSystems), 200uM dATP (Applied 
BioSystems), 200|iM dCTP (Applied BioSystems), 200uM dGTP (Applied BioSystems), 400^M 
dUTP (Applied BioSystems), 1:400,000 diluted Sybr Green dye (Molecular Probes), 1.25U AmpliTaq 
Gold (Applied BioSystems). The PCR master mix was dispensed into two, light-tight tubes. Each 0- 
Actin primer F75 and R178 (Sigma-Genosys, The Woodlands, TX), was added to one tube of PCR 
master mix and Each 0-GUS primer F178 and R242 (Sigma-Genosys), was added to the other tube of 
PCR master mix to a final primer concentration of 300nM. 45|il of the P-Actin or 0-GUS master mix 
was dispensed into wells, in a 96- well plate (Applied BioSystems). 5uJ of the template dilution series 
was dispensed into triplicate wells for each primer. The reaction was run on an ABI 7900 Sequence 
Detection System (Applied BioSystems) with the following conditions: 10 min. at 95°C; 40 cycles of 
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95°C for 15 sec, 60°C for 1 min; followed by a disassociation curve starting at 50°C and ending at 
95°C . 

The Sequence Detection System v2.0 software was used to analyze the fluorescent signal from each 
well. The high end of the baseline was adjusted to between 8 and 20 cycles to reduce the impact on 
any data curves, yet be as high as possible to reduce baseline drift. A threshold value was selected that 
allowed the majority of the amplification curves to cross the threshold during the linear phase of 
amplification. The disassociation curve for each well was compared to other wells for that marker. 
This comparison allowed identification of "bad" wells, those that did not amplify, that amplified the 
wrong size product, or that amplified multiple products. The cycle number at which each amplification 
curve crossed the threshold (C T ) was recorded and the file transferred to MS Excel for further analysis. 
The C T values for triplicate wells were averaged. The data were plotted as a function of the log 10 of the 
calculated starting concentration of RNA. The starting RNA concentration for each cDNA dilution 
was determined based on the original amount of RNA used in the RT reaction, the dilution of the RT 
reaction, and the amount used (5 ui) in the real-time PCR reaction. For each gene, a linear regression 
line was plotted through all of the dilutions series points. The slope of the line was used to calculate 
the efficiency of the reaction for each primer set using the equation: 



Using this equation (Pfaffl 2001, Applied Biosystems User Bulletin #2), the efficiency for these P-actin 
primers is 1.28 and the efficiency for these P-GUS primers is 1.14 (Figure 10). This efficiency was 
used when comparing the expression levels among multiple genes and multiple samples. This same 
method was used to calculate reaction efficiency for primer pairs for each gene studied. A primer pair 
was considered successful if the efficiency was reproducibly determined to be between 0.7 and 2.4. 
SYBR-GREEN ASSAYS 

Once markers passed the Primer Efficiency QPCR (as stated above), they were used in real-time PCR 
assays. Patient RNA samples were reverse-transcribed to cDNA (as described above) and 1:10 
dilutions made in water. In addition to the patient samples, a no template control (NTC) and a pooled 
reference RNA (see example 3) described in were included on every plate. 
The Sybr Green real-time PCR reaction was performed using the Taqman Core PCR Reagent kit 
(Applied BioSystems, Foster City, CA, N808-0228). A master mix was made that consisted of all 
reagents except the primers and template. The final concentration of all ingredients in the reaction was 
IX Taqman Buffer A (Applied BioSystems), 2mM MgC12 (Applied BioSystems), 200uM dATP 
(Applied BioSystems), 200^M dCTP (Applied BioSystems), 200uM dGTP (Applied BioSystems), 
400uM dUTP (Applied BioSystems), 1:400,000 diluted Sybr Green dye (Molecular Probes), 1.25U 
AmpliTaq Gold (Applied BioSystems). The PCR master mix was aliquotted into eight light-tight 
tubes, one for each marker to be examined across a set of samples. The optimized primer pair for each 
marker was then added to the PCR master mix to a final primer concentration of 300nM. 1 8ul of the 
each marker master mix was dispensed into wells in a 384well plate (Applied BioSystems). 2ul of the 
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1:10 diluted control or patient cDNA sample was dispensed into triplicate wells for each primer pair. 
The reaction was run on an ABI 7900 Sequence Detection System (Applied BioSystems) using the 
cycling conditions described above. 

The Sequence Detection System v2.0 software (Applied BioSystems) was used to analyze the 
fluorescent signal from each well. The high end of the baseline was adjusted to between 8 and 20 
cycles to reduce the impact on any data curves, yet be as high as possible to reduce baseline drift. A 
threshold value was selected that allowed the majority of the amplification curves to cross the threshold 
during the linear phase of amplification. The disassociation curve for each well was compared to other 
wells for that marker. This comparison allowed identification of "bad" wells, those that did not 
amplify, that amplified the wrong size product, or that amplified multiple products. The cycle number 
at which each amplification curve crossed the threshold (C T ) was recorded and the file transferred to 
MS Excel for further analysis. The C T value representing any well identified as bad by analysis of 
disassociation curves was deleted. The C T values for triplicate wells were averaged. A standard 
deviation (Stdev) and a coefficient of variation (CV) were calculated for the triplicate wells. If the CV 
was greater than 2, an outlier among the three wells was identified and deleted. Then the average was 
re-calculated. In each plate, AC T was calculated for each marker-control combination by subtracting 
the average C T of the target marker from the average C T of the control (/3-Actin or 0-GUS). The 
expression relative to the control marker was calculated by taking two to the power of the AC T of the 
target marker. For example, expression relative to £-Actin was calculated by the equation: 



All plates were run in duplicate and analyzed in the same manner. The percent variation was 
determined for each sample-marker combination (relative expression) by taking the absolute value of 
the value of the RE for the second plate from the RE for the first plate, and dividing that by the 
average. If more than 25% of the variation calculations on a plate are greater than 50%, then a third 
plate was run. 

TAOMAN PROTOCOL 

Real-time PCR assays were also done using Taqman PCR chemistry. 

The Taqman real-time PCR reaction was performed using the Taqman Universal PCR Master Mix 
(Applied BioSystems, Foster City, CA, #4324018). The master mix was aliquoted into eight, light- 
tight tubes, one for each marker. The optimized primer pair for each marker was then added to the 
correctly labeled tube of PCR master mix. A FAM/TAMRA dual-labeled Taqman probe (Biosearch 
Technologies, Navoto, CA, DLO-FT-2) was then added to the correctly labeled tube of PCR master 
mix. Alternatively, different combinations of fluorescent reporter dyes and quenchers can be used such 
that the absorption wavelength for the quencher matches the emission wavelength for the reporter, as 
shown in Table 5. 18ul of the each marker master mix was dispensed into a 3 84 well plate (Applied 
BioSystems). 2ul of the template sample was dispensed into triplicate wells for each primer pair. The 
final concentration of each reagent was: IX TaqMan Universal PCR Master Mix, 300nM each primer, 
0.25nM probe, 2ul 1:10 diluted template. The reaction was run on an ABI 7900 Sequence Detection 
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System (Applied Biosystems) using standard conditions (95°C for 10 min., 40 cycles of 95°C for 15 
sec, 60°C for 1 min.). 

The Sequence Detector v2.0 software (Applied BioSystems) was used to analyze the fluorescent signal 
from each well. The high end of the baseline was adjusted to between 8 and 20 cycles to reduce the 
impact on any data curves, yet be as high as possible to reduce baseline drift. A threshold value was 
selected that allowed most of the amplification curves to cross the threshold during the linear phase of 
amplification. The cycle number at which each amplification curve crossed the threshold (C T ) was 
recorded and the file transferred to MS Excel for further analysis. The C T values for triplicate wells 
were averaged. The C T values for triplicate wells were averaged. A standard deviation (Stdev) and a 
coefficient of variation (CV) were calculated for the triplicate wells. If the CV was greater than 2, an 
outlier among the three wells was identified and deleted. Then the average was re-calculated. In each 
plate, AC T was calculated for each marker-control combination by subtracting the average C T of the 
target marker from the average C T of the control (/5-Actin or 0-GUS). The expression relative to the 
control marker was calculated by taking two to the power of the AC T of the target marker. All plates 
were run in duplicate and analyzed in the same manner. The percent variation was determined for each 
sample-marker combination (relative expression) by taking the absolute value of the value of the RE 
for the second plate from the RE for the first plate, and dividing that by the average. If more than 25% 
of the variation calculations on a plate are greater than 50%, then a third plate was run. 

BI-PLEXING 

Variation of real-time PCR assays can arise from unequal amounts of RNA starting material between 
reactions. In some assays, to reduce variation, the control gene amplification was included in the same 
reaction well as the target gene. To differentiate the signal from the two genes, different fluorescent 
dyes were used for the control gene. /3-Actin was used as the control gene and the TaqMan probe used 
was labeled with the fluorescent dye VIC and the quencher TAMRA (Biosearch Technologies, Navoto, 
CA, DLO-FT-2). Alternatively, other combinations of fluorescent reporter dyes and quenchers (Table 
5) can be used as long as the emission wavelength of the reporter for the control gene is sufficiently 
different from the wavelength of the reporter dye used for the target. The control gene primers and 
probe were used at limiting concentrations in the reaction (150 nM primers and 0.125 nM probe) to 
ensure that there were enough reagents to amplify the target marker. The plates were run under the 
same protocol and the data are analyzed in the same way, but with a separate baseline and threshold for 
the VIC signal. Outliers were removed as above from both the FAM and VIC signal channels. The 
expression relative to control was calculated as above, using the VIC signal from the control gene. 

ABSOLUTE QUANTITATION 

Instead of calculating the expression relative to a reference marker, an absolute quantitation can be 
performed using real-time PCR. To determine the absolute quantity of each marker, a standard curve is 
constructed using serial dilutions from a known amount of template for each marker on the plate. The 
standard curve may be made using cloned genes purified from bacteria or using synthetic 
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complimentary oligonucleotides. In either case, a dilution series that covers the expected range of 
expression is used as template in a series of wells in the plate. From the average C T values for these 
known amounts of template a standard curve can be plotted. From this curve the C T values for the 
unknowns are used to identify the starting concentration of cDNA. These absolute quantities can be 
compared between disease classes (i.e. rejection vs. no-rejection) or can be taken as expression relative 
to a control gene to correct for variation among samples in sample collection, RNA purification and 
quantification, cDNA synthesis, and the PCR amplification. 

CELL TYPE SPECIFIC EXPRESSION 

Some markers are expressed only in specific types of cells. These markers maybe useful markers for. 
differentiation of rejection samples from no-rejection samples or may be used to identify differential 
expression of other markers in a single cell type. A specific marker for cytotoxic T-lymphocytes (such 
as CD8) can be used to identify differences in cell proportions in the sample. Other markers that are 
known to be expressed in this cell type can be compared to the level of CD8 to indicate differential 
gene expression within CD8 T-cells. 

Control genes for PCR 

As discussed above, PCR expression measurements can be made as either absolute quantification of 
gene expression using a standard curve or relative expression of a gene of interest compared to a 
control gene. In the latter case, the gene of interest and the control gene are measured in the same 
sample. This can be done in separate reactions or in the same reaction (biplex format, see above). In 
either case, the final measurement for expression of a gene is expressed as a ratio of gene expression to 
control gene expression. It is important for a control gene to be constitutively expressed in the target 
tissue of interest and have minimal variation in expression on a per cell basis between individuals or 
between samples derived from an individual. If the gene has this type of expression behavior, the 
relative expression ratio will help correct for variability in the amount of sample RNA used in an assay. 
In addition, an ideal control gene has a high level of expression in the sample of interest compared to 
the genes being assayed. This is important if the gene of interest and control gene are used in a biplex 
format. The assay is set up so that the control gene reaches its threshold Ct value early and its 
amplification is limited by primers so that it does not compete for limiting reagents with the gene of 
interest. 

To identify an ideal control gene for an assay, a number of genes were tested for variability between 
samples and expression in both mononuclear RNA samples and whole blood RNA samples using the 
RNA procurement and preparation methods and real-time PCR assays described above. 6 whole-blood 
and 6 mononuclear RNA samples from transplant recipients were tested. The intensity levels and 
variability of each gene in duplicate experiments on both sample types are shown in Figure 1 1. 
Based on criteria of low variability and high expression across samples, P-actin, 18s, GAPDH, 
b2microglobulin were found to be good examples of control genes for the PAX samples. A single 
control gene may be incorporated as an internal biplex control is assays. 
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Controlling for variation in real time PCR 

Due to differences in reagents, experimenters, and preparation methods, and the variability of pipetting 
steps, there is significant plate-to-plate variation in real-time PCR experiments. This variation can be 
reduced by automation (to reduce variability and error), reagent lot quality control, and optimal data 
handling. However, the results on replicate plates are still likely to be different since they are run in the 
machine at different times. 

Variation can also enter in data extraction and analysis. Real-time PCR results are measured as the 
time (measured in PCR cycles) at which the fluorescence intensity (DRn in Applied Biosystems SDS 
v2.1 software) crosses a user-determined threshold (CT). When performing relative quantification, the 
CT value for the target gene is subtracted from the CT value for a control gene. This difference, called 
ACT, is the value compared among experiments to determine whether there is a difference between 
samples. Variation in setting the threshold can introduce additional error. This is especially true in the 
duplexed experimental format, where both the target gene and the control gene are measured in the 
same reaction tube. Duplexing is performed using dyes specific to each of the two genes. Since two 
different fluorescent dyes are used on the plate, two different thresholds are set. Both of these 
thresholds contribute to each ACT. Slight differences in the each dye's threshold settings (relative to 
the other dye) from one plate to the next can have significant effects on the ACT. 
There are several methods for setting the threshold for a PCR plate. Older versions of SDS software 
(Applied Biosystems) deterrriine the average baseline fluorescence for the plate and the standard 
deviation of the baseline. The threshold is set to lOx the standard deviation of the baseline. In SDS 2.0 
the users must set the baseline by themselves. Software from other machine manufacturers either 
requires the user to set the threshold themselves or uses different algorithms. The latest version of the 
SDS software (SDS 2.1) contains Automatic baseline and threshold setting. The software sets the 
baseline separately for each well on the plate using the ARn at cycles preceding detectable levels. 
Variability among plates is dependent on reproducible threshold setting. This requires a mathematical 
or experimental data driven threshold setting protocol. Reproducibly setting the threshold according to 
a standard formula will minimize variation that might be introduced in the threshold setting process. 
Additionally, there may be experimental variation among plates that can be reduced by setting the 
threshold to a component of the data. We have developed a system that uses a set of reactions on each 
plate that are called the threshold calibrator (TCb). The TCb wells are used to set the threshold on all 
plates. 

1. The TCb wells contain a template, primers, and probes that are common among all plates within an 
experiment. 

2. The threshold is set within the minimum threshold and maximum threshold determined above. 

3. The threshold is set to a value in this range that results in the average CT value for the TCb wells to 
be the same on all plates. 

These methods were used to derive the primers depicted in Table 2C. 
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Example 13: Real-time PCR expression markers of acute allog raft rejection 

In examples 14 and 16, genes were identified as useful markers of cardiac and renal allograft rejection 
using microarrays. Some genes identified through these studies are listed in Table 2. In order to 
validate these findings, obtain a more precise measurement of expression levels and develop PCR 
reagents for diagnostic testing, real-time PCR assays were performed on samples from allograft 
recipients using primers to the identified genes. Some gene specific PCR primers were developed and 
tested for all genes in Table 2A as described in example 12. Some primers are listed in Table 2C and 
the sequence listing. These primers were used to measure expression of the genes relative to P-actin or 
P-gus in 69 mononuclear RNA samples obtained from cardiac allograft recipients using Sybr green 
real-time PCR assays as described in example 12. Each sample was associated with an ISHLT cardiac 
rejection biopsy grade. The samples were tested in 2 phases. In phase I, 14 Grade 0, 1 Grade 1 A, 3 
Grade 2 and 9 Grade 3 A samples were tested. In phase II, 19 Grade 2, 4 Grade IB, 4 Grade 2 and 15 
Grade 3 A samples were tested. Data was analyzed for each phase individually and for the combined 
phase I + II sample set. These data are summarized in Table 6. 

The average fold change in expression between rejection (3A) and no rejection (0) samples was 
calculated. A t-test was done to determine the significance with which each gene was differentially 
expressed between rejection and no rejection and a p-value was calculated. Genes with high average 
fold changes and low p-values are considered best candidates for further development as rejection 
markers. However, it is important to note that a gene with a low average fold change and a high p- 
value may still be a useful marker for rejection in some patients and may work as part of a gene 
expression panel to diagnose rejection. These same PCR data were used to create PCR gene expression 
panels for diagnosis of acute rejection as discussed in example 17. 

Non-parametric tests such as the Fisher Exact Test and Mann- Whitney U test are useful for choosing 
useful markers. They assess the ability of markers to discrininate between different classes as well as 
their significance. For example, one could use the median of all samples (including both non-rejector 
and rejector samples) as a threshold and apply the Fisher Exact test to the numbers of rejectors and 
non-rejectors above and below the threshold. 
These methods were used to generate the data in Table 2D. 



Example 14: Identification of diagnostic nucleotide sets for diagnosis of Card iac Allograft Rejection 
using microarrays 

Genes were identified which have expression patterns useful for the diagnosis and monitoring of acute 
cardiac allograft rejection. Further, sets of genes that work together in a diagnostic algorithm for 
allograft rejection were identified. Acute allograft rejection is a process that occurs in all solid organ 
transplantation including, heart, lung, liver, kidney, pancreas, pancreatic islet cell, intestine and others. 
Gene expression markers of acute cardiac rejection may be useful for diagnosis and monitoring of all 
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allograft recipients. Patients, patient clinical data and patient samples used in the discovery of markers 
below were derived from a clinical study described in example 5. 

The collected clinical data was used to define patient or sample groups for correlation of expression 
data. Patient groups were identified for comparison. For example, a patient group that possesses a 
useful or interesting clinical distinction, verses a patient group that does not possess the distinction. 
Measures of cardiac allograft rejection were derived from the clinical data to divide patients (and 
patient samples) into groups with higher and lower rejection activity over some period of time or at any 
one point in time. Such data were rejection grades as determined from histological reading of the 
cardiac biopsy specimens by a pathologist and data measuring progression of end-organ damage, 
including depressed left ventricular dysfunction (decreased cardiac output, decreased ejection fraction, 
clinical signs of low cardiac output) and usage of inotropic agents (Kobashigawa 1998). 
Mononuclear RNA samples were collected and prepared from patients who had recently undergone a 
cardiac allograft transplantation using the protocol described in example 2. The allograft rejection 
status at the time of sample collection was determined by examination of cardiac biopsies as described 
in example 5 and as summarized here. 

300 patient samples were included in the analysis. Each patient sample was associated with a biopsy 
and other clinical data collected at the time of the sample. The cardiac biopsies were graded by a 
pathologist at the local center and by three centralized pathologists who read the biopsy slides from all 
four local centers in a blinded manner. Biopsy grades included 0, 1A, IB, 2, 3A, and 3B. No grade 4 
rejection was identified. Dependent variables were developed based on these grades using the local 
center pathology reading, the reading of a centralized and blinded pathologist, the highest of the 
readings, local or centralized and a consensus grade derived from all pathological readings. Samples 
were classified as no rejection or rejection in the following ways: Grade 0 vs. Grades 1-4, Grades 0 and 
1 A vs. Grades 1B-4, Grade 0 vs. Grade 3A, Grade 0 vs. Grades 1B-4, and Grade 0 vs. Grades IB and 
3A-4. Grade 0 samples were selected such that they were not immediately followed by an episode of 
acute rejection in the same patient. Comparing Grade 0 samples to Grade 3A samples gives the 
greatest difference between the rejection and no rejection groups on average. 
Taking the highest of all pathologist readings has the effect of removing any sample from the no 
rejection class that was not a unanimous Grade 0. It also results in an increase in the number of 
rejection samples used in an analysis with the assumption that if a pathologist saw features of rejection, 
the call was likely correct and the other pathologists may have missed the finding. Many leading 
cardiac pathologists and clinicians believe that ISHLT grade 2 rejection does not represent significant 
acute rejection. Thus, for correlation analysis, exclusion of Grade 2 samples may be warranted. 
Clinical data were also used to determine criteria for including samples in the analysis. For example, a 
patient with an active infection or in the early post-transplant period (ongoing surgical inflammation) 
might have immune activation unrelated to rejection and thus be difficult to identify as patients without 
rejection. The strictest inclusion criteria required that samples be from patients who did not have a 
bacterial or viral infection, were at least two weeks post cardiac transplant, were asymptomatic and 
were not currently admitted to the hospital. 
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After preparation of RNA (example 2), amplification, labeling, hybridization, scanning, feature 
extraction and data processing were done as. described in Example 1 1, using the oligonucleotide 
microarrays described in Example 9. The resulting log ratio of expression of Cy3 (patient sample)/ 
Cy5 (R50 reference RNA) was used for analysis. 

Significance analysis for microarrays (SAM, Tusher 2001, Example 15) was used to discover genes 
that were differentially expressed between the rejection and no-rejection groups. Many different 
combinations of dependent variables, inclusion criteria, static/referenced, and data subsets were used in 
SAM analysis to develop the primary lists of genes significantly differentially expressed between 
rejection and no-rejection. As described in example 1 5, SAM assigns a false detection rate to each 
gene identified as differentially expressed. The most significant of these genes were identified. 
An exemplary analysis was the comparison of Grade 0 samples to Grade 3A-4 samples using SAM. 
Data from the all the pathological readings was used to identify consensus Grade 0 samples and 
samples with at least one reading of Grade 3 A or above. Using this definition of rejection and no 
rejection, expression profiles from rejection samples were compared to no rejection samples using 
SAM. The analysis identified 7 genes with a FDR of 1%, 15 genes @ 1.4%, 35 genes @ 3.9%. Many 
more genes were identified at higher FDR levels. 

In Table 7, a number of SAM analyses are summarized. In each case the highest grade from the 3 
pathologists was taken for analysis. No rejection and rejection classes are defined. Samples are either 
used regardless of redundancy with respect to patients or a requirement is made that only one sample is 
used per patient or per patient per class. The number of samples used in the analysis is given and the 
lowest FDR achieved is noted. 

Some of the genes identified by SAM as candidate rejection markers are noted in Table 2 A and B. 
SAM chooses genes as significantly different based on the magnitude of the difference between the 
groups and the variation among the samples within each group. It is important to note that a gene 
which is not identified by SAM as differentially expressed between rejection and no rejection may still 
be a useful rejection marker because: 1. The microarray technology is not adequately sensitive to detect 
all genes expressed at low levels. 2. A gene might be a useful member of a gene expression panel in 
that it is a useful rejection marker only in a subset of patients. This gene may not be significantly 
differentially expressed between all rejection and no rejection samples. 

For the purposes of cross-validation of the results, the datasets were also divided into subsets to 
compare analysis between two subsets of roughly half of the data. The types of subsets constructed 
were as follows. First half/second half subsets were the first half of the samples and the second half of 
the samples from a dataset ordered by sample number. Odd/even subsets used the same source, a 
dataset ordered by sample number, but the odd subset consisted of every 2 nd sample starting with the 
first and the even subset consisted of every 2 nd sample starting with the second sample, Center 14/other 
subsets were the same datasets, divided by transplant hospital. The center 14 subset consisted of all 
samples from patients at center 14, while the other subset consisted of all samples from the other three 
centers (12,13, and 15). When a gene was found to be significantly differentially expressed in both sets 
of data, a higher priority was put on that gene for development of a diagnostic test. This was reflected 
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in a "Array Score" value (Table 2B) that also considered the false detection rate for the gene and the 
importance of the gene in classification models (see example 17). 

Alternatively one can divide samples into 10 equal parts and do 10-fold cross validation of the results 
of SAM. 

Microarray data was also used to generate classification models for diagnosis of rejection as described 
in example 17. Genes identified through classification models as useful in the diagnosis of rejection 
are noted in in Table 2B in the column "models'*. 

As genes were identified as useful rejection markers by microarray significance analysis, classification 
models, PCR analysis, or through searching the prior art, a variety of approaches were employed to 
discover genes that had similar expression behavior (coexpression) to the gene of interest. If a gene is 
a useful rejection marker, then a gene that is identified as having similar expression behavior is also 
likely to be a useful rejection marker. Hierarchical clustering (Eisen et al. 1998, see example 15) was 
used to identify co-expressed genes for established rejection markers. Genes were identified from the 
nearest branches of the clustering dendrogram Gene expression profiles generated from 240 samples 
derived from transplant recipients were generated as described above. Hierarchical clustering was 
performed and co-expressed genes of rejection markers were identified. An example is shown in 
Figure 12. SEQ ID NO:85 was shown to be significantly differentially expressed between rejection 
and no rejection using both microarrays and PCR. Gene SEQ ID NO:3020 was identified by 
hierarchical clustering as closely co-expressed with SEQ ID NO:85. In table 2B, genes identified as 
co-expressed with established markers are identified as such by listing the SEQ ID that they are co- 
expressed with in the column labeled "clusters". 

Some of the primers for real-time PCR validation were designed for each of the marker genes as 
described in Example 12 and are listed in Table 2C and the sequence listing. PCR expression 
measurements using these primers were used to validate array findings, more accurately measure 
differential gene expression and create PCR gene expression panels for diagnosis of rejection as 
described in example 17. 

Alternative methods of analyzing the data may involve 1) using the sample channel without 
normalization by the reference channel, 2) using an intensity-dependent normalization based on the 
reference which provides a greater correction when the signal in the reference channel is large, 3) using 
the data without background subtraction or subtracting an empirically derived function of the 
background intensity rather than the background itself. 
These methods were used to identify genes listed in Table 2B. 
Example 15: Correlation and Classification Analysis 

After generation and processing of expression data sets from microarrays as described in Example 1 1 , a 
log ratio value is used for most subsequent analysis. This is the logarithm of the expression ratio for 
each gene between sample and universal reference. The processing algorithm assigns a number of 
flags to data that are of low signal to noise, saturated signal or are in some other way of low or 
uncertain quality. Correlation analysis can proceed with all the data (including the flagged data) or can 
be done on filtered data sets where the flagged data is removed from the set. Filtered data should have 
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less variability and noise and may result in more significant or predictive results. Flagged data contains 
all information available and may allow discovery of genes that are missed with the filtered data set. 
After filtering the data for quality as described above and in example 1 1, missing data are common in 
microarray data sets. Some algorithms don't require complete data sets and can thus tolerate missing 
values. Other algorithms are optimal with or require imputed values for missing data. Analysis of data 
sets with missing values can proceed by filtering all genes from the analysis that have more than 5%, 
10%, 20%, 40%, 50%, 60% or other % of values missing across all samples in the analysis. Imputation 
of data for missing values can be done by a variety of methods such as using the row mean, the column 
mean, the nearest neighbor or some other calculated number. Except when noted, default settings for 
filtering and imputation were used to prepare the data for all analytical software packages. 
In addition to expression data, clinical data are included in the analysis. Continuous variables, such as 
the ejection fraction of the heart measured by echocardiography or the white blood cell count can be 
used for correlation analysis. Any piece of clinical data collected on study subjects can be used in a 
correlation or classification analysis. In some cases, it may be desirable to take the logarithm of the 
values before analysis. These variables can be included in an analysis along with gene expression 
values, in which case they are treated as another "gene". Sets of markers can be discovered that work 
to diagnose a patient condition and these can include both genes and clinical parameters. Categorical 
variables such as male or female can also be used as variables for correlation analysis. For example, 
the sex of a patient may be an important splitter for a classification tree. 

Clinical data are used as supervising vectors (dependent variables) for the significance or classification 
analysis of expression data. In this case, clinical data associated with the samples are used to divide 
samples in to clinically meaningful diagnostic categories for correlation or classification analysis. For 
example, pathologic specimens from kidney biopsies can be used to divide lupus patients into groups 
with and without kidney disease. A third or more categories can also be included (for example 
"unknown" or "not reported"). After generation of expression data and definition of supervising 
vectors, correlation, significance and classification analysis are used to determine which set of genes 
and set of genes are most appropriate for diagnosis and classification of patients and patient samples. 
Two main types of expression data analyses are commonly performed on the expression data with 
differing results and purposes. The first is significance analyses or analyses of difference. In this case, 
the goal of the analysis is to identify genes that are differentially expressed between sample groups and 
to assign a statistical confidence to those genes that are identified. These genes may be markers of the 
disease process in question and are further studied and developed as diagnostic tools for the indication. 
The second major type of analysis is classification analysis. While significance analysis identifies 
individual genes that are differentially expressed between sample groups, classification analysis 
identifies gene sets and an algorithm for their gene expression values that best distinguish sample 
(patient) groups. The resulting gene expression panel and algorithm can be used to create and 
implement a diagnostic test. The set of genes and the algorithm for their use as a diagnostic tool are 
often referred to herein as a "model". Individual markers can also be used to create a gene expression 
diagnostic model. However, multiple genes (or gene sets) are often more useful and accurate 
diagnostic tools. 
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Significance analysis for microarravs (SAN/Q 

Significance analysis fo/ microarrays (SAM) (Jusher 2001) is a method through which genes with a 
correlation between their expression values and the response vector are statistically discovered and 
assigned a statistical significance. The ratio of false significant to significant genes is the False 
Discovery Rate (FDR). This means that for each threshold there are some number of genes that are 
called significant, and the FDR gives a confidence level for this claim. If a gene is called differentially 
expressed between two classes by SAM, with a FDR of 5%, there is a 95% chance that the gene is 
actually differentially expressed between the classes. SAM will identify genes that are differentially 
expressed between the classes. The algorithm selects genes with low variance within a class and large 
variance between classes. The algorithm may not identify genes that are useful in classification, but are 
not differentially expressed in many of the samples. For example, a gene that is a useful marker for 
disease in women and not men, may not be a highly significant marker in a SAM analysis, but may be 
useful as part of a gene set for diagnosis of a multi-gene algorithm. 

After generation of data from patient samples and definition of categories using clinical data as 
supervising vectors, SAM is used to detect genes that are likely to be differentially expressed between 
the groupings. Those genes with the highest significance can be validated by real-time PCR (Example 
1 3) or can be used to build a classification algorithm as described here. 
Classification 

Classification algorithms are used to identify sets of genes and formulas for the expression levels of 
those genes that can be applied as diagnostic and disease monitoring tests. The same classification 
algorithms can be applied to all types of expression and proteomic data, including microarray and PCR 
based expression data. Examples of classification models are given in example 17. The discussion 
below describes the algorithms that were used and how they were used. 

Classification and Regression Trees (CART) is a decision tree classification algorithm (Breiman 1984). 
From gene expression and or other data, CART can develop a decision tree for the classification of 
samples. Each node on the decision tree involves a query about the expression level of one or more 
genes or variables. Samples that are above the threshold go down one branch of the decision tree and 
samples that are not go down the other branch. Genes from expression data sets can be selected for 
classification building with CART by significant differential expression in SAM analysis (or other 
significance test), identification by supervised tree-harvesting analysis, high fold change between 
sample groups, or known relevance to classification of the target diseases. In addition, clinical data can 
be used as independent variables for CART that are of known importance to the clinical question or are 
found to be significant predictors by multivariate analysis or some other technique. CART identifies 
predictive variables and their associated decision rules for classification (diagnosis). CART also 
identifies surrogates for each splitter (genes that are the next best substitute for a useful gene in 
classification). Analysis is performed in CART by weighting misclassification costs to optimize 
desired performance of the assay. For example, it may be most important that the sensitivity of a test 
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for a given diagnosis be > 90%. CART models can be built and tested using 10 fold cross-validation or 
v-fold cross validation (see below). CART works best with a smaller number of variables (5-50). 
Multiple Additive Regression Trees (Friedman, JH 1999, MART) is similar to CART in that it is a 
classification algorithm that builds decision trees to distinguish groups. MART builds numerous trees 
for any classification problem and the resulting model involves a combination of the multiple trees. 
MART can select variables as it build models and thus can be used on large data sets, such as those 
derived from an 8000 gene microarray. Because MART uses a combination of many trees and does not 
take too much information from any one tree, it resists over training. MART identifies a set of genes 
and an algorithm for their use as a classifier. 

A Nearest Shrunken Centroids Classifier can be applied to microarray or other data sets by the methods 
described by Tibshirani et al. 2002. This algorithms also identified gene sets for classification and 
determines their 10 fold cross validation error rates for each class of samples. The algorithm 
determines the error rates for models of any size, from one gene to all genes in the set. The error rates 
for either or both sample classes can are minimized when a particular number of genes are used. When 
this gene number is deterrnined, the algorithm associated with the selected genes can be identified and 
employed as a classifier on prospective sample. 

For each classification algorithm and for significance analysis, gene sets and diagnostic algorithms that 
are built are tested by cross validation and prospective validation. Validation of the algorithm by these 
means yields an estimate of the predictive value of the algorithm on the target population. There are 
many approaches, including a 10 fold cross validation analysis in which 10% of the training samples 
are left out of the analysis and the classification algorithm is built with the remaining 90%. The 10% 
are then used as a test set for the algorithm. The process is repeated 10 times with 10% of the samples 
being left out as a test set each time. Through this analysis, one can derive a cross validation error 
which helps estimate the robustness of the algorithm for use on prospective (test) samples. Any % of 
the samples can be left out for cross validation (v-fold cross validation, LOOCV). When a gene set is 
established for a diagnosis with an acceptable cross validation error, this set of genes is tested using 
samples that were not included in the initial analysis (test samples). These samples may be taken from 
archives generated during the clinical study. Alternatively, a new prospective clinical study can be 
initiated, where samples are obtained and the gene set is used to predict patient diagnoses. 
Example 16: Acute allograft rejection: biopsy tissue gene expression profiling 

Acute allograft rejection involves activation of recipient leukocytes and infiltration into the 
rejecting organ. For example, CD8 T-cells are activated by CD4 T-cells and enter the allograft where 
they destroy graft tissue. These activated, graft-associated leukocytes may reside in the graft, die or 
exit the graft. Upon exiting, the cells can find their way into the urine or blood (in the case of renal 
allografts), bile or blood (liver allografts) or blood (cardiac allografts). These activated cells have 
specific gene expression patterns that can be measured using microarrays, PCR or other methods. 
These gene expression patterns can be measured in the graft tissue (graft associated leukocytes), blood 
leukocytes, urine leukocytes or stool/biliary leukocytes. Thus graft associated leukocyte gene 
expression patterns are used to discover markers of activated leukocytes that can be measured outside 
the graft for diagnostic testing. 



159 



WO 2004/042346 



PCT/US2003/012946 



Renal biopsy and cardiac biopsy tissue specimens were obtained for gene expression profiling. 
The specimens were obtained at the time of allograft biopsy and were preserved by flash freezing in 
liquid nitrogen using standard approaches or immersion in an RNA stablization reagent as per the 
manufacturers recommendation (RNAlater, Qiagen, Valencia, CA). Biopsy allograft pathological 
evaluation was also obtained and samples were classified as having a particular ISHLT rejection grade 
(for cardiac) or acute rejection, chronic rejection, acute tubular necrosis or no disease (for renal). 

28 renal biopsy tissue samples were transferred to RLT buffer, homogenized and RNA was 
prepared using RNeasy preparation kits (Qiagen, Valencia, CA). Average total RNA yield was 1.3 ug. 
Samples were subjected to on column DNAse digestion. 18 samples were derived from patients with 
ongoing acute allograft rejection and 10 were from controls with chronic rejection or acute renal 
failure. 

RNA from the samples was used for amplification, labeling and hybridization to leukocyte arrays 
(example 1 1). Significance analysis for microarrays (SAM, Tusher 2001, Example 15) was used to 
identify genes that were differentially expressed between the acute rejection samples and controls. 
Leukocyte markers of acute rejection that are associated with the graft should be genes that are 
expressed at some level in activated leukocytes. Since leukocytes appear in graft tissue with some 
frequency with acute rejection, leukocyte genes associate with rejection are identified by SAM as 
upregulated in acute rejection in this experiment.' 35 genes were identified as upregulated in acute 
rejection by SAM with less than a 5% false detection rate and 139 were detected with < 10.0% FDR. 
Results of this analysis are shown in Table 8. 

For each of these genes, to 50mer oligonucleotide sequence was used to search NCBI databases 
including Unigene and OMIM. Genes were identified by sequence analysis to be either known 
leukocyte specific markers, known leukocyte expressed markers, known not to be leukocyte expressed 
or expression unknown. This information helped selected candidate leukocyte markers from all 
upregulated genes. This is necessary because some of the upregulated genes may have been expressed 
by renal tissue. Those genes that are leukocyte specific or leukocyte expressed were selected for 
evaluation by PCR in urine and blood samples from patients with and without acute allograft rejection 
(cardiac and renal). These genes are useful expression markers of acute rejection in allograft tissue 
specimens and may also be useful gene expression markers for the process in circulating leukocytes, or 
urine leukocytes. Genes with known leukocyte expression are noted in Table 8. In addition, some of 
the leukocyte expressed genes from this analysis were selected for PCR validation and development for 
diagnosis of acute cardiac rejection and are noted in Table 2. 

Five cardiac rejection markers in the peripheral blood were assayed using real-time PCR in renal 
biopsy specimens. The average fold change for these genes between acute rejection (n = 6) and 
controls (n = 6) is given below. Work is ongoing to increase the number of samples tested and the 
significance of the results. 

PCR assays of cardiac rejection peripheral blood markers in renal allograft tissue. R = rejection, 
NR = No rejection. 
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Gene 


Fold chanqe (R/NR) 


Granzyme B 


2.16 


CD20 


1.42 


NK cell receptor 


1.72 


T-box 21 


1.74 


IL4 


1.3 



Markers of renal rejection that are secreted from cells may be measured in the urine or serum of 
patients as a diagnostic or screening assay for rejection. Genes with lower molecular weight are most 
likely to be filtered into the urine to be measured in this way. Standard immunoassays may be used to 
measure these proteins. In table 8, genes that are known to be secreted are noted. 

Example 17: Microarrav and PCR eene expression panels for diagnosis and m onitoring of acute 
allograft rejection 

Array panels / classification models 

Using the methods of the invention, gene expression panels were discovered for screening and 
diagnosis of acute allograft rejection. Gene expression panels can be implemented for diagnostic 
testing using any one of a variety of technologies, including, but not limited to, microarrays and real- 
time PCR. 

Using peripheral blood mononuclear cell RNA that was collected and prepared from cardiac 
allograft recipients as described in examples 2 and 5, leukocyte gene expression profiles were 
generated and analyzed using microarrays as described in examples 11,13, and 15. 300 samples were 
analyzed. ISHLT rejection grades were used to divide patients into classes of rejection and no 
rejection. Multiple Additive Regression Trees (MART, Friedman, JH 1999, example 15) was used to 
build a gene expression panel and algorithm for the diagnosis of rejection with high sensitivity. 
Default settings for the implementaion of MART called TreeNet 1 .0 (Salford Systems, San Diego, CA) 
were used except where noted. 

82 Grade 0 (rejection) samples and 76 Grade 1B-4 (no rejection) samples were divided into 
training (80% of each class) and testing (20% of each class) sets. A MART algorithm was then 
developed on the training set to distinguish rejection from no rejection samples using a cost of 1 .02: 1 
for misclassiflcation of rejection as no rejection. The resulting algorithm was then used to classify the 
test samples. The algorithm correctly classified 51 of 66 (77%) no rejection samples in the training set 
and 9 of 16 (56%) no rejection samples in the test set. For rejection samples 64 of 64 (100%) were 
correctly classified in the training set and 12 of 12 were correctly classified in the test set. The 
algorithm used 37 genes. MART ranks genes by order of importance to the model. In order, the 37 
genes were: SEQ IDs: 3058, 3030, 3034, 3069, 3081, 3072, 3041, 3052, 3048, 3045, 3059, 3075, 3024, 
279, 3023, 3053, 3022, 3067, 3020, 3047, 3033, 3068, 3060, 3063, 3028, 3032, 3025, 3046, 3065, 
3080, 3039, 3055, 49, 3080, 3038, 3071. 
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Another MART model was built by excluding samples derived from patients in the first month 
post transplant and from patients with known CMV infection. 20 Grade 0 (rejection) samples and 25 
Grade 1B-4 (no rejection) samples were divided into training (80% of each class) and testing (20% of 
each class) sets. A MART algorithm was then developed on the training set to distinguish rejection 
from no rejection samples using default settings. The resulting atgorithm was then used to classify the 
test samples. The algorithm correctly classified 100% of samples of both classes in the training and 
testing sets. However, this model required 169 genes. The sample analysis was done a second time 
with the only difference being requirement that all decision trees in the algorithm be composed of two 
nodes (single decision, "stump model"). In this case 15/16 no rejection samples were correctly 
identified in the training set and 4/4 no rejection samples were correctly identified in the test set. For 
the rejection samples, 17/19 were correctly identified in the training set and 5/6 were correctly 
classified in the test set. This model required 23 genes. In order of importance, they were: SEQ IDs: 
3042, 2783, 3076, 3029, 3026, 2751, 3036, 3073, 3035, 3050, 3051, 3027, 3074, 3062, 3044, 3077, 
2772, 3049, 3043, 3079, 3070, 3057, 3078. 

Real-time PCR panels / classification models 

PCR primers were developed for top rejection markers and used in real-time PCR assays on 
transplant patient samples as described in examples 12 and 13. This data was used to build PCR gene 
expression panels for diagnosis of rejection. Using MART (example 15) a 10-fold cross validated 
model was created to diagnose rejection using 12 no rejection samples (grade 0) and 10 rejection 
samples (grade 3 A). Default settings were used with the exception of assigning a 1.02:1 cost for 
misclassification of rejection as no rejection and requirement that all decision trees be limited to 2 
nodes ("stump model"). 20 genes were used in the model, including: SEQ IDs:101, 3021, 102, 2781, 
78, 87, 86, 36, 77, 2766, 3018, 80, 3019, 2752, 79, 99, 3016, 2790, 3020, 3056, 88. The 10-fold cross- 
validated sensitivity for rejection was 100% and the specificity was 85%. Some PCR primers for the 
genes are listed in Table 2C and the sequence listing. 

A different analysis of the PCR data was performed using the nearest shrunken centroids 
classifier (Tibshirani et al. 2002; PAM version 1.01, see example 15). A 10-fold cross validated model 
was created to diagnose rejection using 13 no rejection samples (grade 0) and 10 rejection samples 
(grade 3 A). Default settings were used with the exception of using a prior probability setting of (0.5, 
0.5). The algorithm derives algorithms using any number of the genes. A 3-gene model was highly 
accurate with a 10 fold cross-validated sensitivity for rejection of 90%, and a specificity of 85%. 

The 3 genes used in this model were: SEQ IDs 2784, 79, and 2794. Some of the PCR 
primers used are given in Table 2C and the sequence listing. An ROC curve was plotted for the 3-gene 
model and is shown in Figure 13. 

Example 18: Assay sample preparation 

In order to show that XDx's leukocyte-specific markers can be detected in whole blood, we 
collected whole blood RNA using the PAXgene whole blood collection, stabilization, and RNA 
isolation kit (PreAnalytix). Varying amounts of the whole blood RNA were used in the initial RT 
reaction (1, 2, 4, and 8ug), and varying dilutions of the different RT reactions were tested (1:5, 1:10, 



162 



WO 2004/042346 



PCT/US2003/012946 



1 :20, 1 :40, 1 :80, 1 : 1 60). We did real-time PCR assays with primers specific to XDx's markers and 
showed that we can reliably detect these markers in whole blood. 

Total RNA was prepared from 14 mononuclear samples (CPT, BD) paired with 14 whole 
blood samples (PAXgene, PreAnalytix) from transplant recipients. cDNA was prepared from each 
sample using 2ug total RNA as starting material. Resulting cDNA was diluted 1:10 and Sybr green 
real-time PCR assays were performed. 

For real-time PCR assays, Ct values of 15-30 are desired for each gene. If a gene's Ct value is 
much above 30, the result may be variable and non-linear. For PAX sample, target RNA will be more 
dilute than in CPT samples. cDNA dilutions must be appropriate to bring Ct values to less than 30. Ct 
values for the first 5 genes tested in this way are shown in the table below for both whole blood RNA 
(PAX) and mononuclear RNA (CPT). 



Gene 


Ct PAX 


Ct CPT 


CD20 


27.41512 


26.70474 


4761 


28.45656 


26.52635 


3096 


29.09821 


27.83281 


GranzymeB 


31.18779 


30.56954 


IL4 


33.11774 


34.8002 


Actin 


19.17622 


18.32966 


B-GUS 


26.89142 


26.92735 



t 

With one exception, the genes have higher Ct values in whole blood. Using this protocol, all 
genes can be detected with Cts <35. For genes found to have Ct values above 30 in target samples, less 
diluted cDNA may be needed. 

Example 19: Allograft rejection diagnostic gene sequence analysis 

Gene products that are secreted from cells or expressed as surface proteins have special 
diagnostic utility in that an assay may be developed to detect relative quantities of proteins in blood 
plasma or serum. Secreted proteins may also be detectable in urine, which may be a useful sample for 
the detection of rejection in renal allograft recipients. Cell surface markers may be detected using 
antigen specific antibodies in ELISA assays or using flow srting techniques such as FACS. 

Each gene that is found to be differentially regulated in one population of patients has several 
potential applications. It may be a target for new pharmaceuticals, a diagnostic marker for a condition, 
a benchmark for titrating drug delivery and clearance, or used in screening small molecules for new 
therapeutics. Any of these applications may be improved by an understanding of the physiologic 
function and localization of the gene product in vivo and by relating those functions to known diseases 
and disorders. Identifying the basic function of each candidate gene helps identify the signaling or 
metabolic pathways the gene is a part of, leading us to investigate other members of those pathways as 
potential diagnostic markers or targets of interest to drug developers. 
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For each of the markers in table 2, we attempted to identify the basic function and subcellular 
localization of the gene. These results are summarized in Table 9. In addition to initial DNA 
sequencing and processing, sequence analysis, and analysis of novel clones, information was obtained 
from the following public resources: Online Mendelian Inheritance in Man at the NCBI, LocusLink at 
the NCBI, the SWISS-PROT database, and Protein Reviews on the Web. For each marker represented 
by a curated reference mRNA from the RefSeq project, the corresponding reference protein accession 
number is listed. Curated sequences are those that have been manually processed by NCBI staff to 
represent the best estimate of the mRNA sequence as it is transcribed, based on alignments of draft 
DNA sequence, predicted initiation, termination and splice sites, and submissions of EST and full- 
length mRNA sequences from the scientific community. 

These methods were used to derive the data in Table 2E. 
Example 20: Detection of proteins expressed by diagnostic gene sequences 

One of ordinary skill in the art is aware of many possible methods of protein detection. The 
following example illustrates one possible method. 

The designated coding region of the sequence is amplified by PCR with adapter sequences at 
either end for subcloning. An epitope or other affinity "tag" such as a "His-tag" may be added to 
facilitate purification and/or detection of the protein. The amplified sequence is inserted into an 
appropriate expression vector, most typically a shuttle vector which can replicate in either bacteria, 
most typically E. coli, and the organism/cell of choice for expression such as a yeast or mammalian 
cell. Such shuttle vectors typically contain origins of replication for bacteria and an antibiotic 
resistance marker for selection in bacteria, as well as the relevant replication and selection sequences 
for transformation/transfection into the ultimate expression cell type. In addition, the sequence of 
interest is inserted into the vector so that the signals necessary for transcription (a promoter) and 
translation operably linked to the coding region. Said expression could be accomplished in bacteria, 
fungi, or mammalian cells, or by in vitro translation. 

The expression vector would then typically be used to transform bacteria and clones analyzed 
to ensure that the proper sequence had been inserted into the expression vector in the productive 
orientation for expression. Said verified expression vector is then transfected into a host cell and 
transformants selected by a variety of methods including antibiotic resistance or nutritional 
complementation of an auxotrophic marker. Said transformed cells are then grown under conditions 
conducive to expression of the protein of interest, the cells and conditioned media harvested, and the 
protein of interest isolated from the most enriched source, either the cell pellet or media. 

The protein is then be isolated by standard of chromatographic or other methods, including 
immunoaffinity chromatography using the affinity "tag" sequence or other methods, including cell 
fractionation, ion exchange, size exclusion chromatography, or selective precipitation. The isolated 
and purified protein is then be used as an antigen to generate specific antibodies. This is accomplished 
by standard methods including injection into heterologous species with an adjuvant, isolation of 
monoclonal antibodies from mice, or in vitro selection of antibodies from bacteriophage display 
antibody libraries. These antibodies are then used to detect the presence of the indicated protein of 
interest in a complex bodily fluid using standard methods such as ELISA or RIA. 
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Example 21: Detecting changes in the rate of hematopoiesis 

Gene expression profiling of blood cells from cardiac allograft recipients was done using 
microarrays and real-time PCR as described in other examples herein. 

Two of the genes in that were most correlated with cardiac transplant acute rejection with both 
microarrays and PCR were hemoglobin Beta and 2,3 DPGM. These genes are well know to be specific 
markers of erythrocyte lineages. This correlation was found using both purified peripheral mononuclear 
cells and whole blood RNA preparations. 

Analysis of the five genes from the PCR data most strongly correlated with rejection showed 
that their expression levels were extremely highly correlated within each other (R2 > 0.85). 



Gene 


Hs 


Acc 


SEQ ID No 


hemoglobin, beta (HBB) 


Hs. 1553 76 


NM_000518 


86 


2,3-bisphosphoglycerate mutase (BP< 


Hs.198365 


X04327 


87 


cDNA FU20347 


Hs.102669 


AK000354 


94 


602620663F1cDNA 


Hs.34549 


AI123826 


107 


HA 1247 cDNA 


Hs.33757 


AI114652 


91 



This suggested that they were all elevated as part of a single response or process. When the 
microarray data was used to cluster these genes with each other and the other genes on the microarray, 
we found that these five genes clustered reasonably near each and of the other array genes which 
clustered tightly with them, four of the top 40 or so were platelet related genes. In addition, these a 
number of these genes clustered closely with CD34. CD34 is a marker of hematopoietic stem cells and 
is seen in the peripheral blood with increased hematopoisis. 

CD34, platelet RNA and erythrocyte RNA all mark immature or progenitor blood cells and it 
is clear that theses marker of acute rejection are part of a coordinated hematopoietic response. A small 
increase in the rate of production of RBCs and platelets may result in large fold changes in RNA levels. 
Immune activation from acute rejection may lead to increased hamatopoiesis in the bone marrow and 
non-marrow sites. This leads to an increase in many lineages because of the lack of complete 
specificity of the marrow response. Alternatively, increased hematopoiesis may occur in a transplant 
recipient due to an infection (viral or other), allergy or other stimulus to the system. This results in 
production of cells or a critical mass of immune cells that can cause rejection. In this scenario, 
monitoring for markers of immune activation would provide an opportunity for early diagnosis. 
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Table 1 



Disease Classification 


Disease/Patient Group 


Cardiovascular Disease 


Atherosclerosis 




Unstable angina 




Myocardial Infarction 




Restenosis after angioplasty 




Congestive Heart Failure 




Myocarditis 




Endocarditis 




Endothelial Dysfunction 




Cardiomyopathy 




Cardiovascular drug use 


Infectious Disease 


Hepatitis A. B, C, D, E. G 




Malaria 




Tuberculosis 




HIV 




Pneumocystis Carinii 




Giardia 




Toxoplasmosis 




Lyme Disease 




Rocky Mountain Spotted Fever 




Cytomegalovirus 




Epstein Ban* Virus 




Herpes Simplex Virus 




Clostridium Dificile Colitis 




Meningitis (all organisms) 




Pneumonia (all organisms) 




Urinary Tract Infection (all organisms) 




Infectious Diarrhea (all organisms) 




Anti-infectious drug use 


Angiogenesis 


Pathologic angiogenesis 




Physiologic angiogenesis 




Treatment induced angiogenesis 




Pro or anti-angiogenic drug use 


Transplant Rejection 


Heart 




Lung 




Liver 




Pancreas 




Bowel 




Bone Marrow 




Stem Cell 




Graft versus host disease 




Transplant vascutopathy 




Skin 




Cornea 




Islet Cells 




Kidnev 
rxiui icy 




Xenotransplants 




Mechanical Organ 




Immunosupressive drug use 


Hematological Disorders 


Anemia - Iron Deficiency 




Anemia - B12, Folate deficiency 




Anemia - Aplastic 




Anemia - hemolytic 




Anemia - Renal failure 




Anemia - Chronic disease 




Polycythemia rubra vera 




Pernicious anemia 




Idiophic Thrrombocytopenic purpura 




Thrombotic Thrombocytopenic purpura 




Essential thrombocytosis 




Leukemia 




Cytopenias due to immunosupression 




Cytopenias due to Chemotherapy 




Myelodysplasia 
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Table 2A. 



Gene 


Gene Name 


SEQ ID 
SOmer 


HS 


ACC 


SEQ ID 
RNA/cDNA 


HSRRN18S 


18S ribosomal RNA 


1 


NA 


X03205 


333 


ACTB 


Actin, beta 


2 


Hs.288061 


NM 001101 


334 ! 


GUSB 


Glucuronidase, beta 


3 


Hs. 183868 


NM 000181 


335 1 


B2M 


beta 2 microglobulin j 


4 


Hs.75415 


NM 004048 


336 ! 


TSN 


Translin 


5 


Hs.75066 


NM 004622 


337 


CCR7 


1707 


6 


Hs.1652 


NM 001838 


338 


IL1R2 


4685-IL1R 


7 


Hs.25333 


NM 004633 


339 


AIF-1 


Allograft inflammatory factor 1, all 
variants 


8 


Hs.76364 


NMJ)04847 


340 


ALAS2 


ALAS2 


9 


Hs.323383 


NM 000032.1 


341 


APELIN 


APELIN 


10 


Hs.303084 


NM 017413 


342 


CD80 


B7-1,CD80 


11 


Hs.838 


NM 005191 


343 


EPB41 


Band 4.1 


12 


Hs.37427 


NM 004437 


344 j 


CBLB 


c-cbl-B 


13 


Hs.3144 


NM 004351 


345 


CCR5 


CCR5 


14 


Hs.54443 


NM 000579 


346 


MME 


CD10 


15 


Hs.1298 


NM 000902 


347 


KLRC1 


CD 159a 


16 


Hs.74082 


NM 002259 


348 J 


FCGR3A 


CD16 


17 


Hs. 176663 


NM 000569 


349 


FCGR3B 


CD16b 


18 


Hs.372679 


NM 000570 


350 


LAG3 


CD223 


19 


Hs/7401 1 


NM 002286 


351 


PECAM1 


CD31 


20 


Hs.78146 


NM 000442 


352 


CD34 


CD34 


21 


Hs.374990 


NM 001773 


353 


FCGR1A 


CD64 


22 


Hs.77424 


NM 000566 


354 


TFRC 


CD71 = T9, transferrin receptor 


23 


Hs.77356 


NM 003234 


355 


CMA1 


chymase 


24 


Hs. 135626 


NM 001836 


356 ! 


KIT 


c-Kit 


25 


Hs.81665 


NM 000222 


357 


MPL 


c-mpl 


26 


Hs.84171 


NM 005373 


358 


EphB6 


EphB6 


27 


Hs.3796 


NM 004445 


359 


EPOR 


EPO-R 


28 


Hs.127826 


NM 000121.2 


360 


Foxp3 


Foxp3 


29 


Hs.247700 


NM 014009 


361 


GATA1 


GATA1 


30 


Hs.765 


NM 002049 


362 


ITGA2B 


GPIIb 


31 


NM 000419.2 


NM 000419 


363 ! 


GNLY 


granulysin 


32 


Hs.105806 


NM 006433 


364 \ 


GZMA 


GZMA 


33 


Hs.90708 


NM 006144 


365 


HBA 


hemoglobin^ alpha 1 


34 


Hs.398636 


NM 000558.3 


366 


HBZ 


hemoglobin, zeta 


35 


Hs.272003 


NM 005332.2 


367 


HBB 


hemoglobin, beta 


36 


Hs. 155376 


NM 000518.4 


368 


HBD 


hemoglobin, delta 


37 


Hs.36977 


NM 000519.2 


369 


HBE 


hemoglobin, epsilon 1 


38 


Hs. 11 7848 


NM 005330 


370 


HBG 


hemoglobin, gamma A 


39 


Hs.283108 


NM 000559.2 


371 


HBO 


hemoglobin, theta 1 


40 


Hs.247921 


NM 005331 


372 


HLA-DP 


MH/c, class II, DP alpha 1 


41 


Hs. 198253 


NM 033554 


373 


HLA-DO 


MHC, class II, DO alpha 1 


42 


Hs.198253 


NM 002122 


374 


HLA-DRB 


MHC, class II, DR beta 1 


43 


Hs.375570 


NM 002124.1 


375 


ICOS 


ICOS 


44 


Hs.5o247 


iNM uizuy/ 


5 /O 


IL18 


IL18 


45 


Hs.83077 


NM 001562 


377 


IL3 


interleukin 3 (colony-stimulating 
factor, multiple) 


46 


Hs.694 


NM_000588 


378 


ITGA4 


Integrin, alpha 4 (antigen CD49D, 
alpha 4 subunit of VLA-4 receptor) 


47 


Hs.40034 


NMJ)00885 


379 
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Gene Name 


SEQ ID 
SOmer . 


HS 


ACC 


SEQ ID 
RNA/cDNA 


1 1 Vj/vlvl 


integrin, alpha M (complement 
component receptor 3, alpha; also 

Imnwn AC fT>1 lh (n\ 7ft\ 

macropnage anugcn aipnu 
polvoentide) 


48 


Hs. 172631 


NM_000632 


380 


ITGB7 


integrin, beta 7 


49 


Hs.1741 


NM 000889 


381 




t AT^ ^r**A AT/f»nVianr*#*T Kinrliniy 
JL«/\r } 1 / viuuiiiWvi i/iiiU-iJig 

nrntAin /P/FRP^ KlPt» 


50 


Hs.99029 


NM_005194 


382 


vrc m 
INr-iiZ 


rvr-nz. 


51 


Hs.75643 


NM 006163 


383 | 


rDLUl 




52 


Hs. 158297 


NM 005018 


384 




plalCltl lavlUI t pencil iUftJiic ^v^~ yv — \^ 

motif) ligand 4) 


53 


Hs.81564 


NM_002619 


385 


PRKCQ 


protein kinase C ? theta 


54 


Hs.211593 


NM 006257.1 


386 


r>T> a t> czf* 1 


OT5 A 17 notwtwfl 


55 


Hs. 198468 


NM 013261 


387 


RAG1 


recombination activating gene 1 


56 


Hs.73958 


NM 000448 


388 


t> a r^i 




57 


Na 


NM 000536 


389 


CXCL12 


chemokine (C-X-C motif) ligand 12 
(stromal cell-derived factor 1) (SDF- 
1) 


58 


Hs.237356 


NM_000609 


390 


TTVTT7D CT7/1 


lull ID I UvlivPaia latiui ivwv/piwi 

supeirainiiy, niciiiuci *r 


59 


Hs. 129780 


NMJ)03327 


391 




himnr n**f»rr\cic fnf*tf>t* ^HcrjiriHi 

supcriaimiy, iiiciiujci «t ^uia.- 
transcnpuonauy acuvaieu 


60 


Hs. 181097 


NM_003326 


392 


TPS1 


tryptase, alpha 


61 


Hs.334455 


NM 003293 


393 


ADA 




62 


Hs.1217 


NM 000022 


394 






63 


Hs.334873 


NM 001874.1 


395 




rAlnn^; ctitnulilttrK* fVlftfYT C~tN^— 


64 


Hs.1349 


NM 000758.2 


396 


Cor 3 


C+imnliltlMO TCtPttfYT It— l SF 


65 


Hs.2233 


NM 172219 


397 


CRP 


C-reactive protein, pentraxin-related 

(CKr*), 


66 


Hs.76452 


NM_000567.1 


398 


FLT3 


FMS-Related Tyrosine Kinase 3 


67 


Hs.385 


NM 004119 


399 


ATA 1 


(~i A T A V\inHtti<T nT/vfpin ^ 


68 


Hs. 169946 


NM 002051.1 


400 


IL7R 


Interleukin 7 receptor 


69 


Hs.362807 


NM 002185.1 


401 


ivLr 1 


EKLF 


70 


Hs.37860 


NM_006563.1 


402 




1irrrvr\l-tr»o\7+f»_cnpr , tflr* nrntpin tvm^inp 
lyiiijjiiUwYtC— spcuiiiv jjiLiiciii ly luoiiiu 

kinase 


71 


Hs.1765 


NM_005356.2 


403 


LEFl 


lymphoid enhancer-binding factor 1 


72 


Hs.44865 


NMJH6269.2 


404 


T>T AT TO 


1 Trrslrinii CP_t"vmP PlaCTYIItlflOPn 

Activator Receptor, CD87, uPAR 


73 


Hs. 179657 


NM_002659.1 


405 


TNFSF13B 


Tumor necrosis factor (ligand) 
superiarniiy, mcinucr ud, 
B 1 vS/TALL- 1 /B AFF 


74 


Hs.270737 


NM_006573.3 


406 


IL8 


Interleukin 8 


75 


Hs.624 


NM 000584 


407 1 


GZMB 


Granzyme B (granzyme 2, cytotoxic 
T-lymphocyte-associated serine 
esterase 1) 


76 


Hs.1051 


NM_004131 


408 


TNFSF6 


Tumor necrosis factor (ligand) 
superfamily, member 6 


77 


Hs.2007 


NMJ)00639 


409 
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Gene 


Gene Name 


5>n,y III 
SOmer 


Ha 


Apr 

ALL 


DMA /«Tfc M A 

KIN A/CDIN A 


TCIRG1 


T-cell, immune regulator 1, ATPase, 
H+ transporting, lysosomal V0 
protein a isoform 3 


78 


ris.4o4oD 


NM_uUoUly 


il A 

410 


PRF1 


Perforin 1 (pore forming protein) 


79 


Hs.2200 


NM 005041 


411 


IL4 


Interleukin 4 


80 


Hs.73917 


X.TX A i\t\i\ c o f\ 

NM 000589 


412 


IL13 


Interleukin 13 


81 


Hs.845 


NM 002188 


413 


CTLA4 


Cytotoxic T-lymphocyte-associated 
protein 4 


82 


Hs.247824 


NM_005214 


414 


CD8A 


CD8 antigen, alpha polypeptide (p32) 


83 


Hs.85258 


NM_001768 


415 


BY55 


Natural killer cell receptor, 
immunoglobulin superfamily member 


84 


Hs.81743 


NM_007O53 


416 


Off) 4460 


EST 


85 


Hs.205159 


API 50295 


417 


HBB 


Hemoglobin, beta 


86 ! 


Hs.155376 


NM 000518 


418 


BPGM 


2,3-bisphosphoglycerate mutase 


87 


Hs. 1983 65 


NM 001724 


A 1 f\ 

419 


MTHFD2 


Methylene tetrahydrofolate 
dehydrogenase (NAD+ dependent), 
methenyltetrahydrofolate 
cyclohydrolase 


88 


Hs. 154672 


NM_006636 


420 


TAP1 


Transporter 1, ATP-binding cassette, 
sub-family B (MDR1/TAP) 


89 


Hs.352018 


NM_000593 


421 | 


KPNA6 


Karyophenn alpha 6 (lmportin alpha 
7) 


90 


Hs.301553 


AW021037 


/in 
422 


OID 4365 


Mitochondrial solute carrier 


91 


TT O f\ /\ A f \ S" 

Hs.300496 


All 14652 


423 


IGHM 


Immunoglobulin heavy constant mu 


92 


Hs.300697 


BC032249 


424 


OID 573 


KIAA1486 protein 


93 


Hs.210958 


AB040919 


lot 

425 ! 


OID 873 


KIAA1892 protein 


94 


Hs. 102669 


AK000354 


426 ! 


OED 3 


EST 


95 


Hs. 104 157 


AW968823 


427 


CXCR4 


Chemokine (C-X-C motif) receptor 4 


96 


Hs.89414 


NM 003467 


A *S O 

428 


CD69 


CD69 antigen (p60, early T-cell 
activation antigen) 


97 


Hs.82401 


NM_001781 


429 


CCL5 


Chemokine (C-C motif) ligand 5 
fRANTES, SCYA5) 


98 


Hs.241392 


NM_002985 


430 


IL6 


Interleukin 6 


99 


Hs.93913 


NM 000600 


431 


IL2 


Interleukin 2 


100 


Hs.89679 


NM 000586 


432 


KLRF1 


Killer cell lectin-like receptor 
subfamilv F. member 1 


101 


Hs.183125 


NMJ 16523 


433 


LYN 


v-yes-1 Yamaguchi sarcoma viral 
related oncogene homolog 


102 


Hs.80887 


NM_002350 


it O A 

434 


IL2RA 


Interleukin 2 receptor, alpha 


103 


Hs.1724 


NM 000417 


435 


CCL4 


Chemokine (C-C motif) ligand 4, 
SCYA4 


104 


Hs.75703 


NM_002984 


436 


Off) 6207 


EST 


105 


Hs.92440 


D20522 


437 


ChGn 


Chondroitin beta 1,4 N- 
aceWlgalactosaminyltransferase 


106 


Hs. 11260 


NMJ 18371 


438 


OID 4281 


EST 


107 


Hs.34549 


AA053887 


439 


CXCL9 


Chemokine (C-X-C motif) ligand 9 
(MIG) 


108 


Hs.77367 


NMJ)02416 


440 


CXCL10 


Chemokine (C-X-C motif) ligand 10, 
SCYB10 


109 


Hs.2248 


NMJXH565 


441 
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Gene 


nna Noma 

v*ene pi a me 


SEO ID 

JUIllvi 


HS 


ACC 


SEQID 
RNA/cDNA 


IL17 


[nterleukin 17 (cytotoxic T- 
lymphocyte-associated serine esterase 
8) 


i in ! 


Hs.41724 


NM 002190 


442 


IL15 


Interleukin 15 


111 


Hs. 168 132 


NM 000585 


443 


IL10 


interieiLKin iu 




Hs 193717 


NM 000572 


444 


1FNG 




1 1 7 


Hs.856 


NM 000619 


445 


HLA-DRB1 


Major histocompatibility complex, 
class 11, DK beta l 


114 


Hs.308026 


NM_002124 


446 


CD8B1 


LUo antigen, oeta polypeptide i 
(d37) 


1 1 S 


Hs 2299 


NM 004931 


447 


CD4 


CD4 antigen (p55) 


116 


Hs. 17483 


NM 000616 1 


448 1 


CXCR3 


Chemokine (C-X-C motif) receptor 3, 
GPR9 


117 


Hs.198252 


NM_001504 


449 


/NTT\ ~t f\C\ A 

OID 7094 


XDx bol 4/y\jIZ 


1 1 R 
1 1 o 




NA 


450 


OH) 7605 


"COT 


1 10 

117 


Hs 109302 


AA808018 


451 


CXCL1 


Chemokine (C-X-C motif) ligand 1 
(melanoma growth stimulating 
activity, aloha) 


120 


Hs.789 


NM_001511 


452 


OID 253 


EST 


121 


Hs.83086 


AK091125 


453 ! 


GPI 




199 


He 409 1 62 

HO.*t\/7 A \9*m 


NM 000175 


454 


CD47 


L,D4 / antigen ^KJi-reiaiea anngen, 
integrin-associated signal transducer) 


19"? 


Hs 82685 


NM 001777 

1 11*1 w A 9 9m 


455 


HLA-F 


Major histocompatibility complex, 
class I T F 


124 


Hs.377850 


NM_0 18950 


456 


OID 5350 


bol 


1 95 


Hs 4283 


AK055687 


457 


TCRGC2 




1 96 


Hs 11 2259 


M17323 


458 


OLD 7016 


EST 


127 


NA 


BI018696 


459 


PTGS2 


Prostagianain-enaoperoxiae syninase 
2 (prostaglandin G/H synthase and 
cvclooxveenase) 


1 9R 


ns. i 7uj o*r 


NM 000963 

1^1 AVI \J\J\J J \J ~J 


460 


OID 5847 


Hypothetical protein r-LJ-J/y iy 


19Q 


Hs 997224 


NM 144588 


461 


PRDM1 


PR domain containing i , wiin z.iNr 
domain 


1 7fl 


Hs 788746 

lib. JOOJ*tU 


NM 001198 


462 


CKB 


Creatine kinase, Brain 


131 


Hs. 173724 


NM 001823 


463 


TNNI3 


Troponin I, cardiac 


179 


Hs 751 782 


NM 000363 


464 


TNNT2 


Troponin T2, cardiac 


133 


Hs.296865 


NM 000364 


465 


MB 


Myoglobin 


174 


Hs 1 18876 

ltd. 1 1 OOJU 


NM 005368 


466 


SLC7A11 


Solute carrier family 7, (cationic 
amino acid transporter, y+ system) 
member 1 1 


135 


Hs.6682 


NM_0 14331 


467 


TNrRorS 


tumor necrosis factor receptor 
superfamilv, member 5: CD40 


1 76 
1 JO 


Hs 25648 


NM 001250 


468 


TNFRSF7 


tumor necrosis factor receptor 
superfamilv, member 7; CJJz / 




Hs 755707 


NM 001242 


469 


CD86 


CL/oo antigen (tuzo anngen iigana 
2, B7-2 antigen) 


1 78 


Hs 27954 

Jlb.Z. / 7J*t 


NM 175862 


470 


ADFlv2 


Allograft inflammatory factor 1 , 
splice variant 2 


1 7Q 


He 76764 


TMM 004847 

1>( iva u\rtot » 


471 


EBV BCLF-1 


BCLF-1 major capsid 


140 


NA 


AJ507799 


472 ! 


EBV EBV 


EBNA repetitive sequence 


141 


NA 


AJ507799 


473 1 


CMV p67 


pp67 


142 


NA 


XI 7403 


474 


CMV TRL7 


c6843-6595 


143 


NA 


X17403 


475 


CMV IEle3 


IE1 exon3 


144 


NA 


X17403 


476 
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Gene 


Gene Name 


SEQ ID 
50mer 


HS 


ACC 


SEQ ID 
RNA/cDNA 


CMV IEle4 


IE 1 exon 4 (40 variants) 


145 


NA 


XI 7403 


477 


EBV EBNA-1 


EBNA-1 coding region 


146 


NA 


AJ507799 


478 


EBV BZLF-1 


Zebra gene 


147 


NA 


AJ507799 


479 


EBV EBN 


EBNA repetitive sequence 


148 


NA 


AJ507799 


480 


EBV EBNA-LP 


Short EBNA leader peptide exon 


149 


NA 


AJ507799 


481 


CMV IE1 


IE1S 


150 


NA 


X17403 


482 


CMV IE1 


IE1-MC (exon 3) 


151 


NA 


XI 7403 


483 


CLC 


Charot-Leyden crystal protein 


152 


Hs.889 


NM 001828 


484 


TERF2IP 


telomeric repeat binding factor 2, 
interacting protein 


153 


Hs.274428 


NM 018975 


485 


HLA-A 


Major histocompatibility complex, 
class I, A 


154 


Hs. 18 1244 


NM 002116 


486 


OED 5891 


EST 3' end 


155 


None 


AW297949 


487 


MSCP 


mitochondrial solute carrier protein 


156 


Hs.283716 


NM 018579 


488 


DUSP5 


dual specificity phosphatase 5 


157 


Hs.2128 


NM 004419 


489 


PR01853 


Hypothetical protein PRO 1853 


158 


Hs.433466 


NM 018607 


490 


OID 6420 


73 A7, FLJ00290 protein 


159 


Hs.98531 


AK090404 


491 


CDSN 


Corneodesmosin 


160 


Hs.507 


NM 001264 


492 


OID 4269 


EST 


161 


Hs.44628 


BM727677 


493 


RPS25 


Ribosomal protein S25 


162 


Hs.409158 


NM 001028 


494 


GAPD 


Glyceraldehyde-3-phosphate 
dehydrogenase 


163 


Hs. 169476 


NMJ)02046 


495 


RPLP1 


Ribosomal protein, large, PI 


164 


Hs.424299 


NM 001003 


496 


OID_5115 


qz23b07.xl cDNA, 3 f end 
/clone=IMAGE:202770 1 


165 


NA 


AI364926 


497 


SLC9A8 


Solute carrier family 9 
(sodium/hydrogen exchanger), 
isoform 8 


166 


Hs.380978 


AB023156 


498 


OID 1512 


IMAGE:3865861 5 clone 5' 


167 


Hs.381302 


BE6 18004 


499 | 


POLR2D 


Polymerase (RNA) II (DNA directed) 
polypeptide D 


168 


Hs. 194638 


NM_004805 


500 


ARPC3 


Actin related protein 2/3 complex, 
subunit 3, 21kDa 


169 


Hs.293750 


NM_005719 


501 


OID 6282 


EST y end 


170 


Hs.17132 


BC041913 


502 


PRO 1073 


PRO1073 protein 


171 


Hs.356442 


AF001542 


503 


OID_7222 


EST, weakly similar to A43932 
mucin 2 precursor, intestinal ] 


172 


Hs.28310 


BG2 60891 


504 


FPRL1 


Formyl peptide receptor-like 1 


173 


Hs.99855 


NM 001462 


505 


FKBPL 


FK506 binding protein like 


174 


Hs.99134 


NM 022110 


506 


PREB 


Prolactin regulatory element binding 


175 


Hs.279784 


NMJH3388 


507 


OID 1551 


Hypothetical protein LOC200227 


176 


Hs.250824 


BE887646 


508 


OID 7595 


DKFZP566F0546 protein 


177 


Hs. 144505 


NM 015653 


509 | 


RNF19 [ 


Ring finger protein 19 


178 


Hs.48320 


NM 015435 


510 


SMCY 


SMC (mouse) homolog, Y 
chromosome (SMCY) 


179 


Hs.80358 


NM 004653 


511 


OID 4184 


CMV HCMVUL109 


180 


NA 


XI 7403 


512 


OID 7504 


Hypothetical protein FLJ35207 


181 


Hs.86543 


NM 152312 


513 


DNAJC3 


DnaJ (Hsp40) homolog, subfamily C, 
member 3 


182 


Hs.9683 


NM_006260 


514 


ARHU 


Ras homolog gene family, member U 


183 


Hs.20252 


NMJ)21205 


515 


OID 7200 


Hypothetical protein FLJ22059 


184 


Hs. 13323 


NM 022752 


516 
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Gene 


Gene Name 


CFO IT! 

50mer 


ri»3 


ACC 


SEQ ID 
RNA/cDNA 


SERPINB2 


Serine (or cysteine) proteinase 
inhibitor, clade B (ovalbumin), 
member 2 




He 7571 


MM 00257 S 


S17 


ENOl 


Enolase 1 , alpha 


186 


Hs.254105 


NM 001428 


518 


OID 7696 


EST 3 end j 




ris.*oouyz 




S19 ' 


OID 4173 


CMV HCMVTRL2 (IRL2) 


188 


NA 


XI 7403 


520 


CSF2RB 


Upstream variant mRNA of colony 
stimulating factor 2 receptor, beta, 
low-affinity (granulocyte- 
macrophace) 


1 QA 

189 


rls.Zoj^ui 


AT 5AOAQQ 


JZ 1 


OID_7410 


CM2-LT0042-28 1 299-062-e 1 1 
LT0042 cDNA, mRNA sequence 


1 OA 

iyu 


Jtiso / j i*o 


a W»^771 7 


^79 


ODD 4180 


CMV HCMVUS28 


1 A 1 

191 


XT A 
JNA 






ODD 5101 


EST 


1 A"> 

lvz 


rlS.l*»4olH 


DVJ40170 / 


J Art 


MOP3 


MOP-3 


1 07 

193 


He IQftAl A 

ris.joU*ny 


1^1 iVL V/ 1 O 1 O J 


525 


RPL18A 


Ribosomal protein L18a 


1 Avf 

1V4 


Ho 1177AA 


JN1Y1 \J\JsjyO\J 




INPP5A 


Inositol polyphosphates- 
phosphatase, 40kDa 


1 AC 

195 


Up 10/IA7Q 




S27 


hIAN7 


Immune associated nucleotide 


1 A/C 

19o 


t_j_ n>i<:i< 
US. 1Z**0/D 




528 


RPS29 


Ribosomal protein S29 


1 A*7 

197 


rlS.DJy 


INlYl uv 1 \JO^ 


529 


OID 6008 


EST 3 end 


i no 


Ho KIIIL 
riS.j DZ 3ZJ 




530 


OID 4186 


CMV HCMVUL122 


1 oo 

ivy 


XT A 


VI 74YV* 


531 


VNN2 


vanin 2 


200 


Hs.121102 


NM 004665 


532 


OID 7703 


KIAA0907 protein 


ZU1 


riS.Z40j0 


TSTNyf 014Q40 

1N1V1 U14747 


533 


OID 7057 


480F8 


OAT 
ZUZ 


"M A 


*t ouro 


534 


OID 4291 


EST 


203 


Hs.355841 


BC038439 


535 


OID 1366 


EST 


ZU4 


Ho 

ris. lODOyj 


f\ YV 0«3Utf*T 1 


536 


EEF1A1 


Eukaryotic translation elongation 
factor 1 alpha 1 


205 


Hs.422118 


NM_001402 


537 


PA2G4 


Proliferation-associated 2G4, 38kDa 


1A£ 

ZUo 


Up 17AAQ1 

ris.j /44y i 


in ivi uuo iyi 


538 

JJO 


GAPD 


Glyceraldehyde-3-pnosphate 
dehydrogenase 


zu/ 


Up 1 <QA~7A 

rlS. lOV't /O 


IN 1V1 Uv/Z, v/*T o 


539 


CHD4 


Chromodomaui hehcase DNA 
binding protein 4 


1AQ 

ZUo 


Up HA AAA 
US. 1 


IN IVI UU 1Z / J 


540 


OID 7951 


E2F-like protein (LOC51270) 


T AO 

ZU9 


Ho 


xj\yf 

rNlYl UiuJ^i 


541 


DAB1 


Disabled homolog I (Drosophila) 


210 


Hs.344127 


NM 021080 


542 


OID 3406 


Hypothetical protein FLJ20356 


211 


Up /CI A^l 


IN IY1 U10700 


543 

«Jt«7 


OID 6986 


462H9 EST 


212 


Hs.434526 


AK093608 


544 


OID 5962 


EST y end 


213 


Hs.372917 


AW452467 


545 


OID 5152 


EST 3' end 


Zl4 


Up 1 

ris.jooyzi 


AO7ZOV/J 


546 


S100A8 


SI 00 calcium-binding protein A8 
(calgranulin A) 


215 


Hs.4 16073 


NM_002964 


547 


HNRPU 


HNRPU Heterogeneous nuclear 
ribonucleoprotein U (scaffold 
attachment factor A) 


216 


Hs.103804 


BM467823 


548 


ERCC5 


Excision repair cross-complementing 
rodent repair deficiency, 
complementation group 5 (xeroderma 
pigmentosum, compiemeniauon 
group G (Cockayne syndrome)) 


Zl / 


Up >iQ<'7i£ 

ris.*ioD/o 


XTNyf 00017^ 




RPS27 


Ribosomal protein S27 
(metallopanstimulin 1) 


218 


Hs. 195453 


NM_001030 


550 


ACRC 


acidic repeat containing (ACRC), 


219 


Hs. 135 167 


NM 052957 


551 
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Gene 


Gene Name 


ceo ID 
SOmer 


HS 


ACC 


SEO ID 
RNA/cDNA 


PSMD11 


Proteasome (prosome, macropain) 
26S subunit, non-ATPase, 1 1 


99ft 


u c OH744 




DJZ. 


OID 1016 


FLJ00048 protein 


221 


Hs.289034 


AK024456 


553 


OID 1309 


AV706481 cDNA 


999 
zzz 


None 


/A. V /UOHO 1 




OIDJ7582 


Weakly similar to ZINC FINGER 
PROTEIN 142 


191 

Z23 


HS- 104yj 


/VNUZ / ODD 


jjj 


OID_4317 


ta73c09.xl 3 end 

/clone=IMAGE:2049712 Ribosomal 
Protein SI 5 


224 


tt_ loll 70 

MS.35 / 1 /y 


A 111 fildO 
AlJ 1 


jjO 


OID 5889 


3' end /clone=IMAGE:3083913 


225 


riS.2j50yo 


A W907JM1 
A WZy /oHj 


^^7 


UBL1 


Ubiquitin-like 1 (sentrin) 


226 


Hs.81424 


NM 003352 


558 J 


OID 3687 


EST 


221 


None 


W03955 


559 1 


OID 7371 


EST 5' 


HQ 
228 


ris.2yuo/4 


DC /3UDU3 


SAO 


SH3BGRL3 


SH3 domain binding glutamic acid- 
rich protein like 3 


229 


Up t none, i 


INIVL_U3 1 ZoO 




SEMA7A 


Sema domain, immunoglobulin 
domain (Ig), and GPI membrane 
anchor, (semaphorin) 7A 


230 


t_t_ i/i/C/in 
rlS.24o4U 


IN IVl UUJ 0 1 2 


SA9 


OED 5708 


EST 3' end 


Til 

2J 1 


XlS.Z404y4 


AWftRI ^4fl 
A W V/o 1 ->**U 


JUJ 


OED 5992 


EST 3 end 


232 


ris.Z3 / /uy 




Sfi4 


IL21 


Interleukin 21 


233 


Hs.302014 


NM 021803 


565 


HERC3 


Hect domain and RLD 3 (HERC3) 


234 


Hs.35804 


NM 014606 


566 


OID 7799 


AluJo/FLAM SINE/Alu 


2,53 




A WR1771 7 


^67 


Pll 


26 serine protease 


23o 


ris.yy / J 


INlYl UUOUZJ 


_7 VJO 


OID 7766 


EST 3 end 


23 / 


Up /11*701 1 

iiS.43 /yj 1 


A W70J.7 1 1 
j\ w ^y*t /ii 




TIMM10 


translocase of inner mitochondrial 
membrane 10 (yeast) homolog 
(TIMM10) 


Z3o 


Me 91^75ft 
XlS. Z3D /3U 


My A17A^fi 


S70 


EGLN1 


Egl nine homolog 1 (C. elegans) 


239 


Hs.6523 


AJ3 10543 


571 


TBCC 


Tubulin-speciflc chaperone c 


1/1 fl 

24U 


OS. / DU04 


INiVl UUJ l^Z 




RNF3 


Ring finger protein 3 


i/i i 
241 


rlS.oo34 


IN IVl UUDji J 


S7^ 


OID_6451 


170F9, hypothetical protein 
FLJ21439 


242 


Hs.288872 


AL834168 


574 


CCNDBP1 


cyclin D-type binding-protein 1 
(CCNDBP1) 


1/11 

243 


rlS.30/y4 


IN IVl 


S7S 

t 


OID 8063 


MUC18 gene exons 1&2 


1/1/1 

244 


XT A 

IMA 


AOoZO*f 


S7fi 

J / D 


SUV39H1 


Suppressor of variegation 3-9 
homolog 1 (Drosophila) 


245 


tj_ 

MS.3 /y3o 


IN IVl UUj I / j 


^ / / 


HSPC048 


HSPC048 protein 


24o 


Tip 1HQQAA 

HS.2 /oy44 


IN IVl UI*fl*lo 


^ /o 


ODD 5625 


EST 3' end from T cells 


247 


Hs.279121 


AW06378O 


579 ! 


WARS 


Tryptophanyl-tRNA synthetase 


24 o 


it- ©mm 


XTAyf AflA 1 R4 

iNJVl UU*Ho*r 


SRft ! 

JOV J 


OID 6823 


107H8 


1/lft 

249 


it. i ico/c i n 

Ms.ioyoiu 


AT C17/iil7 


JO 1 


OID 7073 


119F12 


250 


Hs. 13264 


AL705961 


582 


OID 5339 


EST y end 


251 


Hs.436022 


AI625119 


583 J 


OID_4263 


fetal retina 937202 cDNA clone 
IMAGE:565899 


1 CI 

252 


ris. /Uo / / 


A A 1 1ft^RA 
AAI30jo*f 




MGC26766 


Hypothetical protein MGC26766 


1 CI 

253 


t t _ 1001 C£L 

Hs.2ool5o 


AJvU2 j4 /Z 


CQC 
JOJ 


SERPINB11 


Serine (or cysteine) proteinase 
lnniuiior, ciauc o ^uvaiuunmi/, 
member 1 1 


1 C /I 

254 


Hs.35Uy5o 


XTAyf A0AA7^ 

INM^_UoU*f / D 




OID 6711 


58G4, IMAGE:4359351 5' 


255 


none 


BF968628 


587 


RNF10 


Ring fmger protein 10 


256 


Hs.5094 


NM 014868 


588 


MKRN1 


Makorin, ring finger protein, 1 


257 


Hs.7838 


NM 013446 


589 


RPS16 


ribosomal protein SI 6 


258 


Hs.397609 


NM 001020 


590 
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Gene 


Gene Name 


SEQ ID 
SOmer 


HS 


ACC 


SEQ ID 
RNA/cDNA 


DA 71 A 


RrnmnHnmain adjacent to 7inc finffCr 

/IrvrrtfUTi 1 A 


259 


Hs.8858 


NM_0 13448 


591 


OID 5998 


EST 3* end 


260 


Hs.330268 


AW468459 


592 


ATP^T 1 


ATP cvntha^e H+ transnortinc 
mitochondrial FO complex, subunit g 


261 


Hs. 107476 


NM_006476 


593 


KJlAJ Oj7J 




262 1 


NA 


52B9 


594 


KOAaiN 


T rKi'm lttmic tf*tm triprvnpntiHp 

UOIUUILUUO ICU nil 1V/U|K|/L1U^ 


263 


Hs.25347 


BC004857 


595 




"Wiiflpar pan Vnnrlincy nrntein subunit 

2, 20kDa 


264 


Hs.240770 


NM_007362 


596 


OID 6273 


EST 3' end 


265 


Hs. 158976 


AW294774 


597 


W7T71 O 


T\r\f finopr nrnfpi r> 1 7 


266 


Hs.164284 


NM 033204 


598 




f^Ii#»mrklrinp (C-C mntiH 1it*and 3 


267 


Hs.73817 


D90144 


599 


OID 4323 


IMAGE: 1283731 3' 


268 


Hs.370770 


AA744774 


600 




toQ7M9v1 MPT CHAP CLL1 

oi-milor- fr\ /«nntainc TAR 1 tl \/fPP77 


269 


NA 


AI400725 


601 


PRDX4 


Peroxiredoxin 4 


270 


Hs.83383 


NM 006406 


602 


ts l is. . 


Rmtrm ao^mmaolnVrtili'neTTlia TVTOSinc 

lfin40A 


271 


Hs. 159494 


NM_000061 


603 


KJLU 0Z7O 


Tmnnrtin fipta QiiKllflit mRNA 


272 


Hs. 180446 


AI948513 


604 


PGK1 


Phosphoglycerate kinase 1 


273 


Hs.78771 


NM 000291 


605 




Tumor rtprrn^is factor reCCDtor 

cimprTotniK; m^mViPf 10a 


274 


Hs.249190 


NM_003844 


606 


ADM 


adrenomedullin j 


275 


Hs.394 


NM 001124 


607 






276 


NA 


138G5 


608 


v^zuono 


Af\"\ A A rVimmn«:nmp 90 nnpn readinff 
rramc o 


277 


Hs.88820 


NM_0 16649 


609 


vJlU jZZO 




278 


Hs. 105460 


NM 015393 


610 


ASAH1 


N-acylsphingosine amidohydrolase 
(acid ceramidase) 1 


279 


Hs.75811 


NMJH>4315 


611 


ATT? ^ 


A nti vatin© tranQcritttion factor 5 


280 


Hs.9754 


NM 012068 


612 


KjlD HOO/ 


'hvnntfiptipat nrntfin MfiC14176 


281 


Hs.417157 


NM 032895 


613 


OID 4239 


EST 


282 


Hs. 177376 


BO022840 


614 


MDM2 


Mouse double minute 2, homolog of; 
pj j-oinciiiig piuicm ^iYLL/j.vi^fcy, 
transcripi vandni iyll/ivia, 


283 


Hs. 170027 


NMJ)02392 


615 






284 


Hs.268555 


AF064257 


616 


OID_6039 


Endothelial differentiation, 
lysophosphatidic acid G-protein- 


285 


Hs.122575 


BE502246 


617 


run ao i n 


TMAnF-4S4000fi 


286 


Hs.374836 


AI300700 


618 




XLO 1 j CI1U 


287 


Hs. 118899 


AA243283 


619 


rl\JSJs-A 


riOiein KlJlabc, UllCIlCIUli-lIlUUWlUlw 

double stranded RNA dependent 
activator 


288 


Hs. 18571 


NM_003690 


620 


OID 4288 


IMAGE:2091815 


289 


Hs.309108 


AI378046 


621 


OID 5620 


EST 3' end from T cells 


290 


Hs.279116 


AW063678 


622 


OID 7384 


EST 5' 


291 


Hs.445429 


BF475239 


623 


OID_1209 


EST Weakly similar to hypothetical 
protein FLJ20378 


292 


Hs.439346 


C14379 


624 


CDKN1B 


Cyclin-dependent kinase inhibitor IB 
(p27, Kipl) 


293 


Hs.238990 


NM_004064 


625 
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Gene 


Gene Name 


SEQID 
SOmer 


HS 


ACC 


SEQ ID 
RNA/cDNA 


PLOD 


Procollagen-lysine, 2-oxoglutarate 5- 
dioxygenase (lysine hydroxylase, 
Ehlers-Danlos syndrome type VI) 


294 


Hs.75093 


NM_000302 


626 


OID 5128 


EST 


295 


Hs.283438 


AK097845 


627 


OID 5877 


EST 3* end 


296 


Hs.438118 


AW297664 


628 


FZD4 


Frizzled (Drosophila) homolog 4 | 


297 


Hs. 19545 


NM 012193 


629 


HLA-B 


Major histocompatibility complex, 
class I, B 


298 


Hs.77961 


NM_005514 


630 


OID 5624 


EST y end from T cells 


299 


Hs.279120 


AW063921 


631 


FPR1 


Formyl peptide receptor 1 


300 


Hs.753 


NM 002029 


632 


ODF2 


Outer dense fiber of sperm tails 2 


301 


Hs. 129055 


NM 153437 


633 


OID_5150 


tg04g01.xl cDNA, 3' end 
/clone=IMAGE:2 1 07824 


302 


Hs. 160981 


AI392793 


634 


OID 5639 


EST 3* end from T cells 


303 


Hs.279139 


AW064243 


635 


OID 6619 


469A10 


304 


NA 


469A10 


636 


OID 6933 


463C7, 4 EST hits. Aligned 


305 


Hs.86650 


AI089520 


637 


OID 7049 


480E2 


306 


NA 


480E2 


638 


IL17C 


Interleukin 17C 


307 


Hs.278911 


NM 013278 


639 


OID 5866 


EST 3' end 


308 


Hs.255649 


BM684739 


640 


CD44 


CD44 


309 


Hs.169610 


AA9 16990 


641 


VPS45A 


Vacuolar protein sorting 45A (yeast) 


310 


Hs.6650 


NM_007259 


642 


OID_4932 


aa92c03.rl Stratagene fetal retina 
937202 cDNA clone IMAGE:838756 


311 


NA 


AA457757 


643 


OID 7821 


EST 


312 


NA 


AA743221 


644 


OID_4916 


zr76a03.rl Soares JMhHMPu_S 1 
cDNA clone IMAGE:669292 


313 


NA 


AA252909 


645 


OID 4891 


Hypothetical protein LOC255488 


314 


Hs.294092 


AL832329 


646 ! 


HADHB 


Hydroxyacyl-Coenzyme A 
dehydrogenase/3-ketoacyl-Coenzyme 
A thiolase/enoyl-Coenzyme A 
hydratase (tri functional protein), beta 
subunit 


315 


Hs.146812 


NM_000183 


647 j 


FLJ22757 


Hypothetical protein FLJ22757 


316 


Hs.236449 


NM 024898 


648 


RAC1 


Ras-related C3 botulinum toxin 
substrate 1 (rho family, small GTP 
binding protein Racl) 


317 


Hs.173737 


AK054993 


649 


OID 6415 


72D4, FLJ00290 protein 


318 


Hs.98531 


CA407201 


650 


NMES1 


Normal mucosa of esophagus specific 
1 


319 


Hs. 112242 


NM_032413 


651 


DMBT1 


Deleted in malignant brain tumors 1, 
transcript variant 2 


320 


Hs.279611 


NMJ)07329 


652 


RPS23 


ribosomal protein S23 


321 


Hs.3463 


NM 001025 


653 


ZF 


HCF-binding transcription factor 
Zhangfei 


322 


Hs.29417 


NM_021212 


654 


NFE2L3 


Nuclear factor (erythroid-derived 2)- 
like 3 


323 


Hs.22900 


NM_004289 


655 


RAD9 


RAD9 homolog (S. pombe) 


324 


Hs.240457 


NM 004584 


656 


OID 6295 


EST 3' end 


325 


Hs.389327 


AI880607 


657 


DEFCAP 


Death effector filament-forming Ced- 
4-like apoptosis protein, transcript 
variant B 


326 


Hs.104305 


NM_014922 


658 


RPL27A 


Ribosomal protein L27a 


327 


Hs.76064 


BF214146 


659 


IL22 


Interleukin 22 (IL22) 


328 


Hs.287369 


NM 020525 


660 
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Gene 


Gene Name 


SOmer 


HS 


ACC 


RNA/cDNA 


DCTVVf A A 


PrntpQcnmp ( nrnQflmp msi^mriain^ 

subunit, alpha type, 4, (PSMA4) 


329 


Hs 251531 


NM 002789 


661 


CCNI j 


cvclin I (CCNI) 


330 


Hs.79933 


NM 006835 


662 


THBD 


Thrombomodulin 


331 


Hs.2030 


NM 000361 


663 


CGR19 


Cell growth regulatory with ring 
finger domain 


332 


Hs.59106 


NMJ)06568 


664 
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SEQ ID 
SOmer 


Gene 


Gene Name 


ACC 


SEQ ID 
RNA/cDNA 


Non- 
Para 
Score 


Median 
Rank in 
NR 


Down 
Regulated 




CLC 


Charcot-Levden crvstal orotein 


NM 001828 


484 


779 


4342 




153 ! 


TERF2IP 


telomeric repeat binding factor 2, 

interactinp nrotein 


NM_0 18975 


485 


744 


1775 






HI A-A 


Major histocompatibility 
comnlex class T A 


NM_002116 


486 


735 


125 


1 




OTD 5891 


EST 3' end 


AW297949 


487 


730 | 


7044.5 | 


1 


1 SA 




mitochondrial solute carrier 


NMJ) 18579 


488 


730 


3465.5 




1 ^7 


i/uor 


dual snecificitv ohosohatase 5 


NM 004419 


489 


726 


3122.5 




158 


PR01853 


Hypothetical protein PRO 1853 


NM 018607 


490 


725 


4153 




1 ^0 

i J7 j 


orn Ad9o 


73 A7 FLJ00290 orotein 1 


AK090404 


491 


725 


7000.5 




1 Aft 




r^orn f»riH <» cttioqi n 


NM 001264 


492 


722 


2732 




161 


OID 4269 


EST 


BM727677 


493 


715 


5598.5 ! 




1 a? 

iOZ 




Pihn^nmal nrotein S25 


NM 001028 


494 


710 


164.5 




163 


GAPD 


Glyceraldehyde-3-phosphate 
dehydrogenase 


NMJ)02046 


495 


707 


215.5 




1 f\A 
1 0*4 


PPT PI 


Rihnsomal nrotein 1art?e PI 


NM 001003 


496 


703 


157 




1 AS 




az23b07 xl cDNA 3' end 
/clone=IMAGE:2027701 


AI364926 


497 


703 


6629 


1 


1 


cr C0A8 


Solute carrier familv 9 
(sodium/hydrogen exchanger), 

icr*TV\rm R 

loUlUlili O i 


AB023156 


498 


702 


2538.5 




1 A7 


orn 1 s i i 


IMAGE-3 865861 5 clone 5* 


BE6 18004 


499 


700 


4008 


1 


1 AR 
1 uo 


POT R2D 


Polymerase CRNA^ II (DNA 
directed) polypeptide D 


NM_004805 


500 


700 


4190.5 




1 AO 




Actin related orotein 2/3 

X V Villi 1 VIOlvU Ul^lVUl — ' 

rrvrrml^v cnKimit ^ 21kDa 


NM_005719 


501 


698 


470.5 




1 7ft 


orn 6282 


EST 3* end 


BC041913 


502 


697 


4371.5 




171 


PRO 1073 


PRO 1073 protein 


AF001542 


503 


697 


6754 




1 7? 


OTD 1111 


FST weaklv similar to A43932 
mucin 2 precursor, intestinal 


BG260891 


504 


695 


6759 




I / D 


FPRT 1 


Formvl nentide receotor-like 1 


NM 001462 


505 


692 


4084.5 




1 7d 


FKBPL 


FK506 binding orotein like 


NM 022110 


506 


691 


1780.5 




17S 


PPF"R 


Prolactin repnlatorv element 
binding 


NM_013388 


507 


690 


3568 




1 7 A 
1 /O 


orn i s s i 


Hvnothetical nrotein LOC200227 


BE887646 


508 


689 


6423 


1 


1 77 


OTD 7505 


DKFZP566F0546 protein 


NM 015653 


509 


689 


3882.5 




178 


RNF19 


Ring finger protein 19 


NM 015435 


510 


689 


7700.5 




170 


SMCY 


SMC (mouse) homolog, Y 
chromosome (SMCY) 


NM_004653 


511 


687 


6074.5 




1 Rft 


OTD 4184 


CM V HCM VUL 1 09 


XI 7403 


512 


687 


6810.5 




101 


OTD 7504 


Hvnothetical orotein FLJ35207 


NM 152312 


513 


686 


6939 




182 


DNAJC3 


DnaJ (Hsp40) homolog, 
subfamily C, member 3 


NM_006260 


514 


686 


3932.5 




1 oj 


APT-TT I 


Pas homolop pene familv 

mpmKpr T T 


NM 021205 


515 


686 


7584 




184 


OID 7200 


Hypothetical protein FLJ22059 


NM 022752 


516 


685 


2804.5 




185 


SERPINB2 


Serine (or cysteine) proteinase 
inhibitor, clade B (ovalbujriin), 
member 2 


NM_002575 


517 


684 


4690.5 




186 


ENOl 


Enolase 1, alpha 


NM 001428 


518 


684 


327 




187 


OID 7696 


EST 3' end 


AW297325 


519 


683 


4875.5 
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SEQID 


Gene 


Gene Name 


ACC 


SEQID 
RNA/cDNA 


Non- 
Para 
Score 


Median 
Rank in 
NR 


Down 
Regulated 


188 


OID 4173 


CMV HCMVTRL2 (IRL2) 


X17403 


520 


683 


4010.5 




189 


CSF2RB 


Upstream variant mRNA of 
colony siiinuidiing iduiui z 
receptor, oeid, low-diiinity 
( granule c vte-ma c rooha ee) 


AL540399 


521 


683 


3753 




190 


<^\Tp» ~ia t a 

UHj_/41U 


r'TVyfO T TAA47 7R17QQ_ftfi7-p1 1 
LI vU^Z cuiN/v, riLtviN/\ sequence 


AW837717 

A\ VV OJ tilt 




682 


7445 




I9l 


rxm a 1 on 


f^A/lA/ WPMVT ICO C 


XI 7403 


523 


681 


4359 




192 


ftjn ciAi 


Co X 


RG461987 


524 


681 


7272 




193 


MUrJ 




MM 01S1 8^ 

1/NIV1 UlOl O J 


jij 


681 

WO 1 


4085.5 


1 


194 


t> T>T 1 O A 


KiDosomai proiem Liisa j 


mm onnoKO 


S7fi 


680 ' 


238 




1 ftC 

195 


TVTDT>^ A 

iJNrr 


inositol porypnospnaieo- 


MM 005539 


527 


680 


4838.5 


1 


196 


nlAIN / 


immune associaiea nucieouue 


RH777fi61 

DVJ / f XUU 1 




680 


4718 




1 ft "7 

197 






mm nnio^i 


529 


680 


107.5 




1 ft o 

198 


/"\TT"\ ZTftft Q 

U1U OUUo 


col j ena 


r\ VV J7^-0 / VJ 


510 
j j \j 


679 


6560.5 




i no 
199 


\JLL) **1o0 


PTV/fV WPTV/IVT IT 1 9? 
v^ivi v nv^ivi v vjl, i zz 


vi 740^ 


531 


677 


4788.5 




200 


VNN2 


vanin 2 


NM 004665 


532 


677 


2620.5 




2U1 


r\TT\ TICl'X 


FT A A A0A/7 nrntpi'n 

r>JJ\r\\jy\J / proiem i 


MM 014949 


533 
■j j .j 


676 


6104.5 




202 


OID 7057 


480F8 


480F8 


534 


675 


6862 




2U3 


KJlXJ 4ZV 1 


T7QT 
CO 1 




535 


674 


5618.5 




204 


OID 1366 


EST 


AW850041 


536 


674 


5590.5 


1 


ZIO 


tier I Al 


cuKaryoiic uanbidiion ciungaiiuu 
factor 1 alpha 1 


MM 001402 

l^HVi \J\J 1 "TV/ A. 


537 


672 


232 




2UO 


r> a ^f~lA 


jrTOllieianon-associaicu zvj**, 
ioKlJa 


MM OOfilQI 


S3R 


672 


4402 




207 


GAPD 


Glyceraldehyde-3-phosphate 


NM_002046 


539 


671 


194.5 




20o 


r*UT\A 

CrtU4 


unromoaomain nencase uis/\ 
binding protein 4 


MM 00,177'* 

1N1V1_00 1 Z / J 


540 


671 


2578.5 




zuy 


KJILJ /50 1 


P7F li1r*» r*rr»ti»in H OPS 1 J1(Y\ 


MM 016*571 

1^ 1V1 \J 1 V/-7 ^ 1 


541 


671 


4467 




210 


UArJ 1 


DisaDiea nomoiog i v i^rosopnnaj 


MM 071080 


542 


670 


6357.5 




211 


U1LJ J4U0 


UimntViafi^ol nrAtoin T7T T7 A^i ^ A 


MM O1A0RA 


S43 


669 


2087 




212 


OID 6986 


462H9 EST 


AK093608 


544 


669 


4454 


1 


213 


OID 5962 


EST 3' end 


AW452467 


545 


668 


5870.5 


1 


214 


<J1U J>1DZ 


co i 3 ena 


ftij7^oyj 


546 


668 


6354.5 




215 


S100A8 


SI 00 calcium-binding protein A8 
(calgranulin A) 


NM_002964 


547 


668 


134 




216 


HNRPU 


HNRPU Heterogeneous nuclear 
ribonucleoprotein U (scaffold 
attachment factor A) 


BM467823 


548 


668 


4108 




217 


ERCC5 


Excision repair cross- 
complementing rodent repair 
deficiency, complementation 

DrAiin S fxprnderma 
givJUjj j ^Aviuuviiiia 

pigmentosum, complementation 
group G (Cockayne syndrome)) 


NM_000123 


549 


668 


6430.5 




218 


RPS27 


Ribosomal protein S27 
fmetallopanstimulin 1) 


NM_001030 


550 


668 


160 
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219 



220 



ACRC 



acidic repeat containing (ACRC), 



NM 052957 551 



668 4871.5 



PSMD11 



Proteasome (prosome, 
macropain) 26S subunit, non- 
ATPase, 11 



AI684022 552 



1 



668 4138 



221 



222 



223 



OID 1016 



OID 1309 



FLJ00048 protein 



AK024456 553 



AV706481 cDNA 



1667 [5199 



OID 7582 



AV706481 |554 



Weakly similar to ZINC 
FINGER PROTEIN 142 



1667 17279.5 



AK027866 555 



667 5003.5 



224 



225 



226 



227 



228 



229 



230 



OID 4317 



OID 5889 



ta73c09.xl 3' end 
/clone=IMAGE:20497 1 2 
Ribosomal Protein SI 5 



AI3 18342 556 



667 6499 



3* end /clone=IMAGE:3083913 



UBL1 



OID 3687 



Ubiquitin-like 1 (sentrin) 



AW297843 557 



1666 6837 



EST 



NM 003352 1558 



1666 11978.5 



OID 7371 



W03955 



EST 5* 



1559 



1666 15519.5 



SH3BGRL3 



SEMA7A 



SH3 domain binding glutamic 
acid-rich protein like 3 



BE730505 1560 



|665 7751.5 



NM_031286 561 



665 310 



Sema domain, immunoglobulin 
domain (Ig), and GPI membrane 
anchor, (semaphoring 7A 



NM 003612 562 



665 3505.5 



231 



OID 5708 



EST 3' end 



AW081540 1563 



1665 16224.5 



232 



OID 5992 



EST 3' end 



AW467992 564 



1665 5648 



233 



IL21 



Interleukin21 



NM 021803 565 



|664 15036.5 



234 



HERC3 



Hect domain and RLD 3 
(HERC3) 



NM_014606 566 



664 3056.5 



235 



OID 7799 



AluJo/FLAM SINE/Alu 



AW837717 567 



1664 3544 



236 



Pll 



26 serine protease 



NM 006025 568 



1664 7173 



237 
238 



OID 7766 
TIMM10 



EST 3' end 



AW294711 1569 



1663 7270.5 



translocase of inner 
mitochondrial membrane 10 
yeast) homolog (TIMMIO^ 



NM 012456 570 



663 4779.5 



239 



EGLN1 



Egl nine homolog 1 (C. elegans) 



AJ310543 571 



1662 17172.5 



240 



241 



242 



TBCC 



RNF3 



Tubulin-specific chaperone c 



NM 003192 572 



OID 6451 



Ring finger protein 3 



1662 3384 



170F9, hypothetical protein 
FLJ21439 



NM 006315 1573 



1661 14062 



AL834168 574 



661 7126 



243 



244 



245 



246 



247 



248 



CCNDBP1 



OID 8063 



cyclin D-type binding-protein 1 
(CCNDBPn 



NM 012142 575 



661 1919 



SUV39H1 



HSPC048 



OID 5625 



WARS 



MUC18gene exons 1&2 



Suppressor of variegation 3-9 
homolog 1 (Drosophila) 



X68264 



576 



1661 14692.5 



NM 003173 577 



661 5103 



HSPC048 protein 



EST 3' end from T cells 



NM 014148 578 



1660 5981.5 



Tryptophanyl-tRNA synthetase 



AW063780 579 



1660 4437 



NM 004184 580 



660 905.5 



249 



250 



OID 6823 



107H8 



OID 7073 



119F12 



AL832642 581 



1659 2619 



AL705961 582 



1659 16837.5 



251 



OID 5339 



EST 3' end 



AI625119 583 



1658 14414.5 1 



252 



OID 4263 



fetal retina 937202 cDNA clone 
IMAGE:565899 



AA136584 584 



658 5870 



253 



MGC26766 



Hypothetical protein MGC26766 



AK025472 585 



658 1892.5 
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SERPINB I 
l 


Serine (or cysteine) proteinase 
inhibitor, clade B (ovalbumin), 
member 1 1 


NM_080475 


586 


658 


7535.5 


1 


255 


OID 67ll 


58G4, IMAGE:435935l 5* 


BF968628 


587 


658 


7264 




256 


RNF10 


Ring finger protein 1 0 


NM. 014868 


588 


658 


3127.5 




257 


MKRNl 


Makorin, ring finger protein, l 


NM 013446 


589 


658 


2228.5 




258 


RPS16 


ribosomal protein SI 6 


NM 001020 


590 


657 


165.5 




259 


BAZIA 


Bromodomain adjacent to zinc 
finger domain, lA 


NM_0 13448 


591 


657 


2533 




260 


OID 5998 


EST 3' end 


AW468459 


592 


657 


6339.5 




261 


ATP5L 


ATP synthase, H+ transporting, 
mitochondrial F0 complex, 
subunit g 


NM_006476 


593 


657 


1155 




262 


OID 6393 


52B9 


52B9 


594 


657 


7420.5 




263 


RoXaN 


Ubiquitous te tra trie op ep tide 
containing protein RoXaN 


BC004857 


595 


656 


7378 




264 


NCBP2 


Nuclear cap binding protein 
subunit 2, 20kDa 


NMJ)07362 


596 


656 


4666.5 




265 


OID 6273 


EST 3' end 


AW294774 


597 


656 


5498.5 




266 


HZF12 


zinc finger protein 12 


NM 033204 


598 


656 


4715.5 




267 


CCL3 


Chemokine (C-C motif) ligand 3 


D90144 


599 


656 


4910 


1 


268 


OID 4323 


IMAGE: 1283731 3' 


AA744774 


600 


655 


6406.5 


1 


269 


OID5181 


tg93hl2.xl NCI_CGAP_CLL1 
cDNA clone IMAGE:21 16391 3* 
similar to contains TARl.tl 
MER22 


AI400725 


601 


655 


4838 


1 


270 


PRDX4 


Peroxiredoxin 4 


NM 006406 


602 


655 


3397.5 




271 


BTK 


Bruton agammaglobulinemia 
tyrosine kinase 


NM_000061 


603 


655 


2358 




272 


OID 6298 


Importin beta subunit mRNA 


AI948513 


604 


655 


2433.5 




273 


PGKl 


Phosphogiycerate kinase 1 


NM 000291 


605 


655 


2059.5 




274 


TNFRSF10 
A 


Tumor necrosis factor receptor 
superfamily, member 10a 


NM_003844 


606 ! 


654 


4897.5 


1 


275 


ADM 


adrenomedullin 


NM 001124 


607 


654 


4235 




276 ! 


OID 357 ! 


138G5 1 


138G5 


608 


654 


5427.5 


1 1 


277 


C20orf6 


461A4 chromosome 20 open 
reading frame 6 


NM_0 16649 


609 


654 


6343 


1 


278 


OID 3226 


DKFZP564O0823 protein ! 


NM 015393 


610 


653 


6187.5 




279 


AS AH I 


N-acylsphingosine 
amidohydrolase (acid 
ceramidase) 1 


NMJ)04315 


611 


653 


1003 




ion 

280 


ATF5 


Activating transcription factor 5 


NM^O 12068 


612 


653 


4545.5 




281 ( 


DID_4887 


hypothetical protein MGC1 4376 ] 


NM_032895 


613 


653 


2310 


1 


282 < 


DID 4239 I 


EST ] 


BQ022840 < 


514 


552 ; 


2774.5 




283 I 


WDM? l 
( 
I 


viouse aouoie minute 2, nomolog 
3f; p53-binding protein (MDM2), 
ranscript variant MDM2, 


NM_002392 ( 


515 \ 


552 < 


4342 




284 : 


>£RN2 t 


>'-3' exoribonuclease 2 j 


VF064257 ( 


316 ( 


552 ( 


S896.5 
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285 


OID_6039 


Endothelial differentiation, 
lysophosphatidic acid G-protein- 
coupled receptor, 4 (EDG4) 


BE502246 


617 


652 


5147 




286 


OID 4210 


IMAGE:4540096 


AI300700 


618 


652 


1330.5 




287 


OID 7698 


EST 3' end 


AA243283 


619 


652 


7432.5 


1 


288 


PRKRA 


Protein kinase, interferon- 
inducible double stranded RNA 
dependent activator 


NM_003690 


620 


652 


3512.5 




289 


OID 4288 


IMAGE:2091815 


AI378046 


621 


651 


6401.5 




290 


OID 5620 


EST y end from T cells 


AW063678 


622 


651 


6400 




291 


OID 7384 


EST 5' 


BF475239 


623 


651 


6875 




292 


OID_1209 


EST Weakly similar to 
hypothetical protein FLJ20378 


C14379 


624 


651 


1356.5 


1 


293 


CDKN1B 


Cyclin-dependent kinase inhibitor 
IB (p27, Kipl) 


NM_004064 


625 


650 


4272.5 




294 


PLOD 


Procollagen-lysine, 2- 
oxoglutarate 5-dioxygenase 
(lysine hydroxylase, Ehlers- 
Danlos syndrome tvpe VI) 


NM_000302 


626 


650 


3101 




295 


OID 5128 


EST 


AK097845 


627 


650 


6476 




296 


OID 5877 


EST 3' end 


AW297664 


628 


650 


6864.5 


1 


297 


FZD4 


Frizzled (Drosophila) homolog 4 


NM_012193 


629 


650 


5816 




298 


HLA-B 


Major histocompatibility 
complex, class I, B 


NM_005514 


630 


650 


229 




299 


OID 5624 


EST 3' end from T cells 


AW063921 


631 


649 


7812.5 




300 


FPR1 


Formyl peptide receptor 1 


NM 002029 


632 


649 


1156.5 




301 


ODF2 


Outer dense fiber of sperm tails 2 


NM_1 53437 


633 ; 


649 


4982.5 




302 


OID_5150 


tg04g01.xl cDNA, 3* end 
/clone==IMAGE:2 107824 ! 


AI392793 


634 


649 


7638 




303 


OID 5639 


EST 3' end from T cells 


AW064243 


635 


648 


6805 


1 


304 


OID 6619 


469A10 


469A10 


636 


647 


7110 


1 


305 


OID 6933 


463C7, 4 EST hits. Aligned 


AI089520 


637 


647 


6880.5 


1 


306 


OID 7049 


480E2 


480E2 


638 


647 


7128.5 




307 


IL17C 


Interleukin 17C 


NM 013278 


639 


647 


6411.5 




308 


OID 5866 


EST 3* end 


BM684739 


640 


647 


6532 


1 


309 


CD44 


CD44 


AA9 16990 


641 


646 


4758 




310 


VPS45A 


Vacuolar protein sorting 45A 
(yeast) 


NM_007259 


642 


646 


3371 




311 


OID_4932 


aa92c03.rl Stratagene fetal 
retina 937202 cDNA clone 
IMAGE:838756 


AA457757 


643 


646 


6057 


1 


312 


OID 7821 


EST ] 


AA743221 


644 


645 


7507 




313 


OID_4916 


zr76a03.rl Soares_NhHMPu_Sl 
cDNA clone IMAGE:669292 


AA252909 


645 


645 


6962.5 


1 


314 


OID_4891 


Hypothetical protein LOC255488 


AL832329 


646 


645 


6148.5 
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31 j 


XT A r\TIT) 


Hyaroxyacyi-i^oenzyine /\ 
dehydrogenase/3-ketoacyl- 
Coenzyme A thiolase/enoyl- 
Coenzyme A hydratase 
(trifunctional protein), beta 
subunit 








-?Z 1 Z. J 




316 : 


FU22757 


Hypothetical protein FLJ22757 j 


NM 024898 


648 


644 


1965.5 


1 


317 


RACl 


Ras-related C3 botulinum toxin 
substrate 1 (rho family, small 
GTP binding protein Racl ) 


AJvlOnyVj 


o4y 


A4>i 

04*f 






318 


OID 64 15 


72D4, FLJ00290 protem 


L,A4U/zUl 


A<A 


A/1/1 
0*14 


/1QQ 1 




319 


NMES1 


Normal mucosa of esophagus 
specific 1 


NMJ)32413 


651 


644 


6217 


i 
i 


320 


DMBT1 


Deleted in malignant brain 
tumors 1, transcript variant 2 


NM_007329 


652 


644 


7284 




321 


RPS23 


ribosomal protein S23 


INJVI UUIUZj 




A/11 

04J 


O 1 o ^ 

Z IV. 3 




322 


ZF 


HCF-binding transcription factor 
Zhangfei 


iNIvl__UZlzlZ 


A^/l 
Oj4 


Ail 7 


4HAQ 




323 


NFE2L3 


Nuclear factor (erythroid-derived 
2j-like 3 


NM_004289 


655 


643 


3378 




324 


RAD 9 


RAD9 nomolog (o. pombe) 


JNlVl UU4:>o4 


a< a 








325 


OID 6295 


ESI 3 end 


AlooUoU/ 


OD / 


A/ll 
04J 






326 


DEFCAP 


Death effector filament-forming 
Ced-4-like apoptosis protein, 
transcript variant B 


NM_014922 


658 


643 


3059 




327 


RPL27A 


Ribosomal protein L27a 


Drzl414o 


A<Q 

o^y 


A/10 
04 Z 


A*;7i 


i 
i 


328 


1L22 


lnterleukin 22 (LLzz^ 


XTTV/f 

NM UZIOZD 


oou 


04Z 




i 
i 


329 


PSMA4 


Proteasome (prosome, 
macropainy suoumi, aipna iypc, 
4, (PSMA4) 


NMJ)02789 


661 


641 


1934.5 




330 


CCNI 


cyclin I (CCNI) 


NM 006835 


662 


641 


980.5 




331 


THBD 


Thrombomodulin 


NM 000361 


663 


640 


4732.5 




332 


CGR19 


Cell growth regulatory with ring 
linger domain 


NM 006568 


664 


640 


5510 
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PCR 


PCR 




PCR 


PCR 










Forward 


Reverse 


PCR 


Forward 


Reverse 


PCR 




SEQ ID 


SEQ ID 


Primer 1 


Primer 1 


Probe 1 


Primer 2 


Primer 2 


Probe 2 


Gene 


50mer 


RNA/cDNA 


SEQ ID 


SEQ ID 


SEQ ID 


SEQ ID 


SEQ ED 


SEQ ID 


HSRRN18S 


1 


333 


665 


996 


1327 








ACTB 


2 


334 


666 


997 


1328 








GUSB 


3 


335 


667 


998 


1329 


1656 


1Q04 

1 7V/*t 


71 S7 
z l _>z 


B2M 


4 


336 


668 


999 


1330 








TSN 


5 


337 


669 


1000 


1331 


1657 


ions 


71 SI 

z U J 


CCR7 


6 


338 


670 


1001 


1332 








IL1R2 


7 


339 


671 


1002 


1333 


1658 


ion£ 


71 S4 
Z 1 DH 


AIF-1 


8 


340 


672 


1003 


1334 








ALAS2 


9 


341 


673 


1004 


1335 








APELIN 


10 


342 


674 


1005 


1336 








CD80 


11 


343 


675 


1006 


1337 


1659 


1007 


Z 1HJ 


EPB41 


12 


344 


676 


1007 


1338 








CBLB 


13 


345 


677 


1008 


1339 


1660 


1 008 


71 Sfi 

Z 1 JO 


CCR5 


14 


346 


678 


1009 


1340 


1661 


100Q 


71 S7 
Zl J / 


MME 


15 


347 


679 


1010 


1341 


1662 


1010 


?1 SR 

Z 1 JO 


KLRC1 


16 


348 


680 


1011 


1342 


1663 


101 1 

17 11 


?1 SO 

Z 1J7 


FCGR3A 


17 


349 


681 


1012 


1343 








FCGR3B 


18 


350 


682 


1013 


1344 


1664 


101? 


?1 fin 

Z 1 Ov/ 


LAG3 


19 


351 


683 


1014 


1345 


1665 


1913 


7161 


PEC AMI 


20 


352 


684 


1015 


1346 


1666 


1014 


Z 1 VIZ 


CD34 


21 


353 


685 


1016 


1347 


1667 

i \J\J i 


101 s 


?1fil 

Z I Dj 


FCGR1A 


22 


354 


686 


1017 


1348 


1668 


1016 


zi o*t 


TFRC 


23 


355 


687 


1018 


1349 








CMA1 


24 


356 


688 


1019 


1350 


1669 


1017 


?16S 

Z 1 UJ 


KIT 


25 


357 


689 


1020 


1351 








MPL 


26 


358 


690 


1021 


1352 


1670 


1018 


Z 1 uu 


EphB6 


27 


359 


691 


1022 


1353 








EPO-R 


28 


360 


692 


1023 


1354 








Foxp3 


29 


361 


693 


1024 


1355 


1 671 


1010 


71 fi7 

Z ID / 


GATA-1 


30 


362 


694 


1025 


1356 








ITGA2B 


31 


363 


695 


1026 


1357 


1 67? 


1 070 


7 1 AR 

Z lOo 


GNLY 


32 


364 


696 


1027 


1358 


167"% 


1071 

1 Z? Z 1 


71 fiO 

Z 1 07 


GZMA 


33 


365 


697 


1028 


1359 


1674 


107? 1 


71 70 
Zl /u 


HBA 


34 


366 


698 


1029 


1360 




1071 I 

1 7£ D 


7171 

Z 1 / 1 


HBZ 


35 


367 


699 


1030 


1361 


1676 


1 074 


717? 
Z 1 /Z 


HBB 


36 


368 


700 


1031 


1362 


1*77 




71 71 
zl /J 


HBD 


37 


369 


701 i 


1032 ! 


1363 


1678 


1 0?fi 


7 1 74 
Zl /*♦ 


HBE 


38 


370 


702 


1033 


1364 




1077 


717S 

Z 1 / J 


HBG 


39 


371 


703 


1034 


1365 


1 680 


1078 i 
i 7zo 


71 7fi 
Z 1 / o 


HBQ 


40 


372 


704 


1035 


1366 ! 


1681 


1070 
i y&y 


7177 i 
Z 1 / / 


HLA-DP 


41 


373 


705 


1036 


1367 


1682 


1010 


7178 
Z 1 / o 


HLA-DQ 


42 


374 


706 


1037 


1368 


1681 


1011 
i yo i 


7170 

Zl /J* 


HLA-DRB 


43 


375 


707 


1038 


1369 


1684 


101? ! 


? 1 RO i' 
Z I OU 


ICOS 


44 


376 


708 


1039 


1370 


168S 


1011 
1 7 JJ 


7181 
Z 1 o 1 


IL18 


45 


377 


709 


1040 


1371 


1686 


1014 

1 7Jt 


71 87 
Z 1 oZ 


IL3 


46 


378 


710 


1041 


1372 


1687 


101*% 

1 7JJ 


71 81 

Z 1 Oj 


ITGA4 


47 | 


379 


71 1 


1042 


1373 








ITGAM 


48 


380 


712 


1043 


1374 


1688 
i vi o o 


1016 

l 7JU 


71 84 


ITGB7 


49 


381 


713 


1044 


1375 








CEBPB 


50 


382 


714 


1045 


1376 


1689 


1937 


2185 


NF-E2 


51 


383 


715 


1046 


1377 








PDCD1 


52 


384 


716 


1047 


1378 


1690 


1938 


2186 


PF4 


53 


385 


717 


1048 1 


1379 


1691 


1939 


2187 


PRKCQ 


54 


386 


718 


1049 


1380 


1692 


1940 


2188 j 


PPARGC1 


55 


387 


719 


1050 


1381 
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PCR 
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Reverse 


PCR 
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Reverse 


PCR 






oLy 11/ 


Primpr % 

XI 111ICI M. 


Primpr 1 


Probe 1 


Primer 2 


Primer 2 


Probe 2 


Gene 


50mer 


DMA /**Th'N A 

KJNA/CUINA 










crn in 

uLy LLP 




RAG1 


56 


388 


720 


1051 


1382 


1693 


1 O A 1 

1941 


2189 i 


RAG2 


57 


389 


721 


1052 


1383 


1694 


1942 


2190 


CXCL12 


58 


390 


722 


1053 


1384 


1695 


1943 


2191 


TNFRSF4 


59 


391 


723 


1054 


1385 


1696 


1944 


2192 


TNFSF4 


60 


392 


724 


1055 


1386 


1697 


1945 


2193 


TPS1 


61 


393 


725 


1056 


1387 


1698 


1946 


2194 


ADA 


62 


394 


726 


1057 


1388 


1699 


1947 


2195 


CPM 


63 


395 


727 


1058 


1389 


1700 


1948 


2196 


CSF2 


64 


396 


728 


1059 


1390 


1701 


1949 


2197 


CSF3 


65 


397 


729 


1060 


1391 


1702 


1950 


2198 1 


CRP 


66 


398 


730 


1061 i 


1392 


1703 


1951 


2199 


FLT3 


67 


399 


731 


1062 1 


1393 


1704 


1952 


2200 


GATA3 


68 


400 


732 


1063 


1394 


1705 


1953 j 


2201 


IL7R 


69 


401 


733 


1064 


1395 


1706 


1954 


2202 


KLF1 


70 


402 


734 


1065 


1396 


1707 


1955 


2203 


LCK 


71 


403 


735 


1066 


1397 


1708 


1956 


2204 


LEF1 


72 


404 


736 


1067 


1398 


1709 


1957 


2205 


PLAUR 


73 


405 


737 


1068 


1399 


1710 


1958 


2206 


TNFSF13B 


74 


406 


738 


1069 


1400 


1711 


1959 


2207 


IL8 


75 


407 


739 


1070 


1401 








GZMB 


76 


408 


740 


1071 


1402 








TNFSF6 


77 


409 


741 


1072 


1403 








TCIRG1 


78 


410 


742 


1073 


1404 








PRF1 


79 


411 


743 


1074 


1405 








IL4 


80 


412 


744 


1075 


1406 








IL13 


81 


413 


745 


1076 


1407 








CTLA4 


82 


414 


746 


1077 


1408 








CD8A 


83 


415 


747 


1078 


1409 








BY55 


84 


416 


748 


1079 


1410 








OID 4460 


85 


417 


749 


1080 


1411 








HBB 


86 


418 


750 


1081 


1412 








BPGM 


87 


419 


751 


1082 


1413 








MTHFD2 


88 


420 


752 


1083 


1414 








TAP1 


89 


421 


753 


1084 


1415 








KPNA6 


90 


422 


754 


1085 


1416 








OID 4365 


91 


423 


755 


1086 


1417 








IGHM 


92 


424 


756 


1087 


1418 








OID 573 


93 


425 


757 


1088 


1419 


1712 


1960 


2208 


OID 873 


94 


426 


758 


1089 


1420 








OID 3 


95 


427 


759 


1090 


1421 








CXCR4 


96 


428 


760 


1091 


1422 








CD69 


97 


429 


761 


1092 


1423 








CCL5 


98 


430 


762 


1093 


1424 








IL6 


99 


431 


763 


1094 


1425 








IL2 


100 


432 


764 


1095 


1426 








KLRF1 


101 


433 


765 


1096 


1427 








LYN 


102 


434 


766 


1097 


1428 








IL2RA 


103 


435 


767 


1098 


1429 








CCL4 


104 


436 


768 


1099 


1430 








OID 6207 


105 


437 


toy 


1 t C\f\ 
1 IUU 










ChGn 


106 


438 


770 


1101 


1432 








OID 4281 


107 


439 


771 


1102 


1433 








CXCL9 


108 


440 


772 


1103 


1434 








CXCL10 


109 


441 


773 


1104 


1435 








IL17 


110 


442 


774 


1105 


1436 
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SEQID 


SEQ ID 


Primer 1 


Primer 1 


Probe 1 


Primer 2 


Primer 2 


Probe 2 


Gene 


50mer 


RNA/cDNA 


SEQID 


SEQID 


SEQID 


SEQID 


SEQID 


SEQID 


IL15 


111 


443 


775 


1106 


1437 








IL10 


112 


444 


776 


1107 


1438 








IFNG 


113 


445 


777 


1108 


1439 


1713 


1061 

I 7 Kj 1 


9 900 
zzv/7 


HLA-DRB1 


114 


446 


778 


1109 


1440 


1714 

X / xt 


10^9 

1 70^ 


991 0 
zzi u 


CD8B1 


115 


447 


779 


1110 


1441 








CD4 


116 


448 


780 


1111 


1442 








CXCR3 


117 


449 


781 


1 1 12 


1443 








OID 7094 


118 


450 


782 


1113 


1444 








OID 7605 

VIA/ 9 


1 19 


451 


783 


1 1 14 

1 X Xt 


144S 








CXCL1 


120 


452 


784 


1115 


1446 








OID 253 


121 


453 


785 


1116 
1 x 1 u 


1447 

X *t t / 








GPI 


122 


454 


786 


1 1 17 


1448 








CD47 


123 


455 


787 


1118 


1440 








HLA-F 


124 


456 


788 


1 1 19 

J X X 7 


14S0 








OID 5350 


125 


457 


789 


1120 


1451 








TCRGC2 


126 


458 


790 


1121 


1452 








OID 7016 


127 


459 


791 


1 122 










PTGS2 


128 


460 


792 


1 123 


1454 








OID 5847 


129 


461 


793 


1124 


1455 








PRDM1 


130 


462 


794 


1 125 


1456 








CKB 


131 


463 


795 


1126 


1457 








TNNI3 


132 


464 


796 


1127 


1458 








TNNT2 


133 


465 


797 


1128 


1459 








MB 


134 


466 


798 


1129 


1460 








SLC7A11 


135 


467 


799 


1 130 


1461 








TNFRSF5 


136 


468 


800 


1131 

X X «7 X 


1462 


171S 


1061 


991 1 
zz 1 1 


TNFRSF7 


137 


469 


801 


1132 


1463 








CD86 


138 


470 


802 [ 


1 133 

X X ~J ~' 


1464 

i tut 








AIFlv2 


139 


471 


803 


1134 


1465 








EV BCLF-1 


140 


472 


804 


1 1 IS 


1466 
1 tuu 


171 f\ 
l no 


1 Qf%A 
i yon 


991 9 
ZZ I Z 


EV EBV 


141 


473 


805 


1 136 


1467 
x tu / 


1717 
i / 1 1 


1 Q£S 

1 7UJ 


991 1 


CMV o67 


142 


474 


806 


1137 


1468 

ituo 


171 8 

1 / 1 o 


1 700 


9914 

ZZ 1H 


CMV TRL7 


143 


475 


807 


1 138 

X X JO 


1469 

1 tU7 


1710 

1/17 


1067 


991 S 
ZZ 1 3 


CMVIEle3 


144 


476 


808 


1 139 


1470 

It/v 


1770 


1068 

1 7UO 


991*5 
zz 1 0 


CMV IEle4 


145 


477 


809 


1 140 


1471 


1791 
i / z i 


10£Q 


99 1 7 
zz 1 / 


EV EBNA-1 


146 


478 


810 


1 141 


1479 

X *T / Z 


1 799 

1 /ZZ 


1 070 
1 y / v 


99 1 R 
ZZ 1 O 


EV BZLF-1 


147 


479 


81 1 


1 142 I 


1471 

X*t / J 


1791 

1 / ZJ 


1071 

17 / 1 


99 10 

ZZ 1 7 


EV EBN 


148 


480 


812 


1 143 


1474 


Mid 
1 (At 


1079 

I7/a 


9990 
zzzu 


EV EBNA-L 


149 


481 


813 


1 144 


1475 








CMVIE1 


150 


482 


814 


1145 


1476 

It / \J 


179S 


1071 

1 7 I J 


9991 
zzz 1 


CMV IE1 


151 


483 


815 


1 146 


1477 








CLC 


152 


484 


816 


1 147 


1478 

l t / O 


1796 


1074 

1 7 1 1 


9999 


TERF2IP 


153 


485 


817 


1 148 


1479 


1797 

1 / Z f 


107S 

17/J 


9991 

ZZZ-J 


HLA-A 


154 


486 


818 


1 149 


1480 

1 tou 


1798 

1 'AO 


107fi 

17/U 


9994 

ZZZH- 


OID 5891 


155 


487 


819 


1 1 so 

X 1 J v 


1481 

I to X 


1790 

1 / Z7 


1077 
\y 1 / 


999S 
ZZZ j 


MSCP 


156 


488 


820 


1 1 SI 


1489 
i toz 


1710 
x /jU 


1078 
1 y 1 0 


979A 
ZzzO 


DUSP5 


157 


489 


821 


1 1 S9 


1481 


1711 ! 
1 t j I 


1070 


9997 
ZZZ / 


PRO 1853 


158 


490 


822 


1 1 S3 


1484 


1719 

1 1 ~>Z 


1 080 

1 70U 


9998 
ZZZO 


OID 6420 


159 


491 


823 


1 1 S4 

X X J*T 


1485 


1711 


1081 
1 70 1 


9990 

ZZZ7 


CDSN 


160 


492 


824 


1155 


1486 


1734 


1982 


2230 


OID 4269 


161 


493 


825 


1156 


1487 


1735 


1983 


2231 


RPS25 


162 


494 


826 


1157 


1488 


1736 


1984 


2232 


GAPD 


163 


495 


827 


1158 


1489 


1737 


1985 


2233 


RPLP1 


164 


496 | 


828 


1159 


1490 


1738 


1986 


2234 


OID 5115 


165 


497 


829 


1160 


1491 


1739 


1987 


2235 
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PCR 


PCR 
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PCR 
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SEQID 


SEQID 


Primer 1 


Primer 1 


Probe 1 


Primer 2 


Primer 2 


Probe 2 


Gene 


SOmer 


RNA/cDNA 


SEQID 


SEQID 


SEQID 


SEQID 


SEQID 


SEQID 


CT f^Q A ft 


1 66 


/IOO 

4Vo 


o3U 


1 1U1 


140? 


1 740 f 


1 0ftft 

1700 


??16 


Ull/ 1 3 1 z 


167 


/1QO 

**yy 


jni i 
0 j 1 


1 162 


149^ 


1741 


i yoy 


??17 


•dot oon 


loo 


CAA 

30U 


1 


1 \f\\ 
1 I U-5 


1404 


174? 


1 oon 






1 AO 

ioy 


jU1 


ft'*'* 1 


1 1 64 

1 1 U*T 




174^ 


1QQ1 
177 1 


??10 

At J7 


U1JLJ 6Z5Z 


1 7A 
1 /0 


c ao 
3UZ 


03*f 


1 1 6S 


1 406 
1 tyo 


1 744 


IOO? 


??4H 


rKU 1 073 


1 "7 1 

1 71 


503 


ftl* 
533 


1 1 OO 


my t 


1 74 


1 001 
iyyj 


??41 

ZZH 1 


/~\TT\ TITO 

ULU /22z 


1 OO 

172 


CA/i 

304 


COA 1 
530 


1 1 67 


14Qft 
iHyQ 


1 746 
1 /HO 


1 OQd 

lyy** 


??4? 


I7T>T>T 1 

rPRLl 


173 


505 


00 / 


1 1 6£ 
1 1 Oo 


1 /IOO 


1 7/17 


1 00^ 
iyy3 




pi/ T>T>T 

FKBPL 


174 


506 


COQ 

535 


1 1 AO 

1 loy 


13UU 


1 7/lft 
1 /45 


1 0OA 

iyyo 




nn T7T> 

PREB 


175 


CAT 

50/ 


QOQ 


I 1 OA 

I I /u 


1 cm 

13U1 


1 7ilO I 

1 /**y 


1 Q07 

iyy / 


7?4S 


OID 1551 


176 


CAO 

508 


54U 


1101 
11/1 


1 CAO 
1 3UZ 


1 7CA 
1 /3U 


1 QQft 

iyys 


7746 


OID 7595 


177 


CAA 

509 


841 


I 1 00 

I I /z 


1 CAO 
13U3 


1 7C 1 
1 /31 


1 000 
iyyy 


??47 

ZZH / 


RNF19 


t TO 

178 


CIA 

510 


Q/IO 
54 Z 


1 1 00 
11/3 


13U4 


1 7^? 
1 /3Z 


?nnn 




SMCY 


1 "7A 

179 


511 


0/40 I 
543 


11/4 


1 CAC 
13U3 


1 7CO 
1 /33 


?nnt 
zuvji 


??40 


OLD 4184 


1 OA 

180 


CIO 

512 


Q.AA 

544 


1 1 0C 
11/3 


1 CAA 
1 3U0 


1 7C/1 
1 /34 


onn? 
zuuz 




OID 7504 


181 


513 


Q/1C 
543 


1 1 OA 
11/0 


1 CAO 
13U / 


1 /33 


?nni 

ZUU3 


??S1 T 


T\\T JL T/"»"> 

DNAJC3 


182 


C 1 A 

514 


Q/l A 

846 


1 1 00 

11// 


1 CAQ 

1305 


1 7CA 
1 /30 


0AA4 


7?S? 
ZZ3Z 


A T> T TT T 

ARHU 


183 


515 


54/ 


1 1 OQ 

11/5 


1 CAO 

i3uy 


1 7C7 
1 /3 / 


?nn^ 

ZUU3 


77S"% 


OID 7200 


1 OA 

184 


516 


ft/lQ 

5**5 


I 1 00 

I I /y 


1 C 1 A 
13 1U 


1 7^R 
1 /35 


7006 


??^4 


OT7T> T1TKTD O 

SERPINB2 


185 


en 
51 / 


54y 


1 1 OA 
1 15U 


1 c 1 1 
1311 








t:\TA 1 

bNUl 


1 Q/£ 
186 


315 


53U 


1151 


1 3 1Z 


1 7^0 

1 /3y 


?no7 


??SS 


nin oaoa 

uiL/ /oyo 


1 Q7 
15 / 


CIO 

3 iy 


5 j 1 


I 1 ft9 

I I oZ 


1^1^ 


1 760 
1 l\>\J 


?oor 

Z.UVJO 




OID 4173 


188 


OA 

5ZU 


QCO 

53Z 


1 1 00 
1 153 


131^ 


1 7A1 
1 /01 


?nno 
zuuy 


??<%7 1 




ion 
189 


CO 1 
3Z1 


QCO 

5D3 


1 1 QA 
1 154 


1 ci c 
1313 


1 7AO 
1 /OZ 


?ni n 

ZU 1U 


??^R 


OID 74 1U 


iyu 


coo 


5D4 


1 1 QC 
1 1 53 


1 C 1 A 
13IO 


1 761 
1 /03 


?ni 1 
zui 1 


??SO 


OID 4180 


191 


coo 
3Z3 


QCC 

5jj 


1 1 OA 
1 150 


1 C 1 0 
131 / 


1 764 
1 /OH 


?fil ? 

ZU 1Z 


??6f> 1 


/"MT"V CI A1 

OID 5101 


192 


CO/I 

3Z4 


5jO 


1 1 CO 
115/ 


1 C 1 ft 
1315 


1 76^ 
1 /03 


?ni 1 

ZU13 


??61 


MOP3 


1 A"! 

193 


coc 

JZJ 


53 / 


1 1 CO 
1 155 


1 C1Q 

13 iy 


1 766 
1 /OO 


?m 4 

ZU 1** 


??6? 


T>T>T 1 O A 


iy4 


COA 


535 


1 1 ftO 
1 15U 


1 3ZU 


1 767 
1 /O / 


?ni s 

Zlsi 3 


??6^ 


TXTT>"T» C A 

INPP5A 


1 AC 

195 


coo 
52/ 


QCO 
53V 


I 1 OA 

I I y\J 


1 CO 1 
13Z1 


1 7Aft 
1 /05 


?ni 6 
zuio 


OOAzl 
ZZOH 


hIAN7 


196 


coo 
528 


O^A 
8OU 


1101 

1 iy 1 


1 COO 

13ZZ 


1 7AO 

1 /oy 


oAi 7 
zui / 


ZZ03 


RPS29 


197 


COA 

529 


861 


1 1 00 
1 iyz 


1 coo 
13Z3 


1 OOA 

1 / /u 


OAI ft 
ZUI 5 


??AA 
ZZOO 


OID 6008 


1 AO 

198 


530 


O/CO 

862 


1 1 00 
1 iy3 


1 CO/1 

13Z4 


1 771 
1 / / 1 


OAI O 

zuiy 


??67 
ZZO / 


/"*VTT"\ A 1 O ^ 

OID 4186 


1 AA 

199 


CO 1 

531 


o<o 
503 


1 1 QA 

1 iy4 


1 coc 
13Z3 


1 770 
1 / /Z 


onon 
zuzu 


??6ft 
ZZOO 


VNN2 


1 AA 

200 


coo 
532 


OA/1 

864 


1 1 oc 

1 iy3 


1 COA 

13Z0 


1 700 
1 / /3 


oaoi 
zuzi 


??60 

zzoy 


OID 7703 


1A1 

201 


coo 
533 


QAC 

503 


1 1 OA 

1 iyo 


1 CO"7 

13Z / 


1 77/1 
1 / /*♦ 


on?o 


??7H 
zz / u 


/^kTT\ OACO 

OID 7057 


1A0 
202 


CO/1 

534 


OAA 

800 


1 1 oo 


1 COQ 

13Z5 


1 77C 
1 / /3 


0A00 
ZUZ3 


??71 
zz / 1 


U1D 4291 


OAl 

203 


coc 
535 


50/ 


1 1 OQ 

1 1V5 


1 COO 
13Zy 


1 77A 
1 / /O 


?n?4 
zuz*i 


??77 
ZZ / z 


OID 136o 


OA/1 

204 


co a 
536 


OAO 

505 


1 1 00 

1 ivy 


1 COA 

1330 


1 077 
if// 


OAO< 
ZUZ3 


??7^ 
ZZ / 3 


tree 1 a 1 
bJbr 1A1 


205 


coo 
53 / 


OAO 

509 


1 OAA 

1 zuu 


1 CO 1 
1331 


1 77ft 
1 / /5 


onoA 
zuzo 


??74 i 

ZZ /*T 


T> A OZ""M 

PA2U4 


ZOO 


CO 0 
538 


OO A 

5/U 


1 oni 

1ZU1 


1 COO 
1 33Z 


1 770 

1 / /y 


?n?7 
zuz / 


??7^ 
Z^ / 3 


UArD 


OAO 

20/ 


coo 
539 


ooi 
5/1 


1 0A0 
1ZUZ 


1 COO 

1333 


1 7CA 
1 /5U 


?n?ft 

ZUZ5 


??7a; 
zz / 0 


OUT^/I 
^J1JJ4 


ZUS 


C/l A 

34U 


000 
5/Z 


1 OAO 
1ZU3 


1 CO/i 
133** 


1 7ft 1 
1 /51 


?n?o 
zuzy 


??77 


/^vT*r\ ooc 1 
U1D /95 1 


OAO 
Z09 


C/f 1 

541 


000 
5/3 


1 OA/1 
1ZU4 


1 COC 
1333 


1 7CO 
1 /5Z 


ZU3U 




t\ Ant 

DAB1 


T1 A 

210 


C/IO 

542 


OO/l 

8/4 


1 OAC 
1ZU3 


1 COA 
1330 


1 7QO 
1 /53 


OA1 1 
ZU3 1 


??70 
zz /y 


OID 3406 


21 1 


C/l 0 

543 


00c 
875 


1 OA<C 

1206 


1 coo 
1537 


1 OQ/i 
1 /54 


OAOO 


??ftn 

ZZ5U 


OID 6986 


212 


C/l /< 

544 


00^ 
876 


1 OAO 

1207 


1 coo 
1335 


1 7QC 

1 /53 


OAOO 
ZU33 


??fti 

ZZO 1 


OID 5962 


213 


C/l c 

545 


on 

877 


t OAO 

1208 


1 COA 

1539 


1 OQA 

1 786 


OAO/1 
ZU34 


??ft? 
ZZ5Z 


OID 5 I 52 


214 


546 


OTO 

878 


1209 


1540 


1 000 

178/ 


OAOC 
Z033 


OOftO 
ZZ53 


S100A8 

O I vV/AO 


215 


547 


879 


1210 


1541 


1788 


2036 


2284 


HNRPU 


216 


548 


880 


1211 


1542 


1789 


2037 


2285 


ERCC5 


217 


549 


881 


1212 


1543 


1790 


2038 


2286 


RPS27 


218 


550 


882 


1213 


1544 


1791 


2039 


2287 


ACRC 


219 


551 


883 


1214 


1545 


1792 


2040 


2288 


PSMDll 


220 


552 


884 


1215 


1546 


1793 


2041 


2289 
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Reverse 
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PCR 






oJL\£ LLP 


Primal* 1 

r rimer i 


Primpr 1 

X 1 unci X 


Probe 1 


Primer 2 


Primer 2 


Probe 2 


Gene 


50mer 


RiNA/CDlN A 


SEQ ID 


Qirn in 
oi_iV<: I LP 


Ottt\l LLP 


&tLi\g LLP 


cvn in 




OLD 1016 


221 ! 


553 


885 


1216 


1547 


1794 


2042 


haa 
2290 


OID 1309 


222 ■ 


554 ! 


886 


1217 


1548 


1795 


2043 


2291 


OID 7582 


223 


555 


887 I 


1218 


1549 


1796 


2044 


2292 


OID 4317 


224 


556 


888 


1219 


1550 


1797 


2045 


2293 


OID 5889 


225 


557 


889 


1220 


1551 


1798 


2046 


11 A>1 

2294 


UBL1 


226 


558 


890 


1221 


1552 


1799 


2047 


Tine 

2295 


OID 3687 


227 


559 


891 


1222 


1553 


1800 


2048 


229o 


OID 7371 


228 


560 


892 


1223 ! 


1554 


1801 


2049 


229/ 


SH3BGRL3 


229 


561 


893 


1224 


1555 


1802 


2050 


229o 


SEMA7A 


230 


562 


894 


1225 


1556 


1803 


2051 


linn 
2299 


OID 5708 


231 


563 


895 


1226 


1557 


1804 


2052 


2300 


OID 5992 


232 


564 


896 


1227 


1558 


1805 


2053 1 


11 A1 

2301 


IL21 


233 


565 


897 j 


1228 


1559 


1806 


2054 


2302 


HERC3 


234 


566 


898 


1229 


1560 


1807 


2055 


11A1 

2303 


OID 7799 


235 


567 


899 


1230 


1561 


1808 


TAf£ 

2056 


11 A/1 

Z3U4 


Pll 


236 


568 


900 


1231 


1562 


1809 


1/\ C*7 

2057 


Z3U3 , 


OID 7766 ! 


237 


569 


901 


1232 


1563 


1810 


2058 


11 AiC 

230O ! 


TIMM10 


238 


570 


902 


1233 


1564 


1811 


2059 


230 / 


EGLN1 


239 


571 


903 


1234 


1565 


1812 


2060 


11 AQ 

Z3Uo 


race 


240 


572 


904 


1235 


1566 * 


1813 


20ol 


11AO i 

Z3uy 


RNF3 


241 


573 


905 


1236 ! 


1567 


1814 


1 A^l 

2062 


11 1 A 

2310 


OID 6451 


242 


574 


906 


1237 


1568 


1815 


2063 


11 1 1 

231 1 


CCNDBP1 


243 


575 


907 


1238 


1569 


1816 


2064 


1111 
Z 3 1Z 


OID 8063 


244 


576 


908 


1239 


1570 


1817 


20O3 


11 1 "2 

Z3 13 


SUV39H1 


245 


577 


909 


1240 


1571 


1818 


2066 


11 1 A 

Z3 14 


HSPC048 


246 


578 


910 


1241 


1572 


1819 


2067 


2315 


OID 5625 


247 


579 


911 


1242 


1573 


1820 


2068 


11 1 A 

23 lo 


WARS 


248 


580 


912 


1243 


1574 


1821 


2069 


ii 1 1 
231 / 


OID 6823 


249 


581 


913 


1244 


1575 


1822 


2070 


11 1 o 

23 lo 


OID 7073 


250 


582 


914 


1245 


1576 


1823 


2071 


111 A 

2319 


OID 5339 


251 


583 


915 


1246 


1577 


1824 


2072 


111A 

2320 


OID 4263 


252 


584 


916 


1247 


1578 


1825 


2073 


in i 
2321 


MGC26766 


253 


585 


917 


1248 


1579 


1826 


2074 


1111 

2322 


SERPINB11 


254 


586 


918 


1249 


1580 


1827 


2075 


nil 

2323 


OID 6711 


255 


587 


919 


1250 


1581 


1828 


2076 


iii/i 
2324 


RNF10 


256 


588 


920 


1251 


1582 


1829 


2077 


ni^ 
2323 


MKRN1 


257 


589 


921 


1252 


1583 


1830 


<<iAn o 

2078 


nix 
232o 


RPS16 


258 


590 


922 


1253 


1584 


1831 


2079 


2327 


BAZ1A 


259 


591 


923 


1254 


1585 


1832 


2080 


111Q 

232o 


OID 5998 


260 


592 


924 


1255 


1586 


1833 


2081 


11 1 A 

2329 


ATP5L 


261 


593 


925 


1256 


1587 


1834 


2082 


11 "J A 

2330 


OID 6393 


262 


594 


926 


1257 


1588 








RoXaN 


263 


595 


927 


1258 


1589 


1835 


2083 


in i 

2331 


NCBP2 


264 


596 


928 


1259 


1590 


1836 


2084 


1111 
2.551. 


OID 6273 


265 


597 


929 


1260 


1591 


1837 


2085 


in i 
2333 


HZF12 


266 


598 


930 


1261 


1592 


1838 


2086 


11 1 A 

2334 


CCL3 


267 


599 


931 


1262 


1593 


1839 


2087 


11 1 c 

2333 


OID 4323 


268 


600 


932 


1263 


1594 


1840 


2088 


2336 


OID 5181 


269 


601 














PRDX4 


270 


/cai 

OOZ 


Oil 




1595 


1841 


2089 


2337 


BTK 


271 


603 


934 


1265 


1596 


1842 


2090 


2338 


OID 6298 


272 


604 


935 


1266 


1597 


1843 


2091 


2339 


PGK1 


273 


605 


936 


1267 


1598 


1844 


2092 


2340 


TNFRSF10/ 


274 


606 


937 


1268 


1599 


1845 


2093 


2341 


ADM 


275 


607 


938 


1269 


1600 


1846 


2094 


2342 
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PCR 


PCR 




PCR 


PCR 










Forward 


Reverse 


PCR 


Forward 


Reverse 


PCR 




SEQ ID 


SEQ ID 


Primer 1 


Primer 1 


Probe 1 


Primer 2 


Primer 2 


Probe 2 






RNA/cDNA 


SEO ID 


SEQ ID 


SEQ ID 


SEQ ID 


SEO ID 


SEQ ID 


ATT\ OCT 

OLD 357 


276 


608 


mo 


1 77A 
1Z /U 


1 AOI 
lOv I 


1 5*1 / 


OAQ< 


Z343 


CzOorfo 


277 


609 


94U 


1 11 1 
1Z 1 1 


1 £A1 

i ouz 


1 QAQ 




17/1 /t 
Z344 


OID 3226 


278 


610 


941 


1 111 

12/2 


1 £A7 
10U3 


1 €MO 

1 B4V 


*>AAT 

ziiy / 


ii/i < 
Z34j 


A C A Til 

ASAH1 


*»*TA 

279 


61 1 


A/11 

942 


1 111 


1 AH/1 
10U4 


15DU 


OAOQ 


ll/l fi 
Z340 


ATF5 


280 


612 


943 


1 1*7 >1 

1274 


101O 


lo J 1 


2U99 


11/1*7 

234/ 


OID 4887 


281 


1 1 

613 


944 


1275 


1 £A£ 


IojZ 


O 1 AA 

21UU 


11/1Q 
Z34o 


OID 4239 


282 


614 


945 


1276 


1 AA1 

16U/ 


1 C^l 


1 1 A1 

21U1 


11/1Q 
Z34y 


MDM2 


283 


615 


946 


1277 


1608 


1854 


1 1 AO 

21U2 


11CA i 
Z33U 


XRN2 


284 


616 


947 


1278 


1609 


iocc 
1855 


n l ai 


IK 1 

2351 


OID 6039 


285 


617 


948 


1279 


1£1 A 

1610 


1856 


O 1 A/i 

21U4 


11C1 

235z 


OID 4210 


286 


618 


949 


1280 


i/i i i 
1611 


1857 


O t AC 

21U5 


11C1 

2353 


OID 7698 


287 


619 


950 


1281 


i ^ 1 1 
1612 


1S58 


O 1 C\tZ 

Z 1UO 


11C/1 

Z3j4 


PRKRA 


288 


620 


951 


1 T O '"l 

1282 


1613 


1 OCA 


1 1 AT 


nee 
Z3DD 


OID 4288 


289 


621 


952 


1283 


1614 


186U 


1 1 AQ 

zlUo 


11CA 

Z330 


OID 5620 


290 


622 


953 


1 ID/I 

1284 


1615 


1861 


1 1 AQ 


1K7 

Z33 / ] 


OID 7384 


291 


623 


954 


1 1 o c 

1285 


1616 


1862 


11 1 A 

2110 


7KQ 
Z3DO 


OID 1209 


292 


624 


AC c 

955 


1286 


101 / 


loOJ 


Zl 1 1 


Z33y 


CDKN1B 


293 


625 


956 


1 1 0*7 

1287 


lolo 


1 Q£LA 
1504 


Zl 1Z 


Z30U 


PLOD 


294 


626 


AC"7 

957 


1AOO 

1288 


1619 


loOD 


Z 1 1.5 


Z301 i 


OID 5128 


295 


oil 


958 


1 ion 
1289 


1 <1A 

lOZU 


1 Q <A 
loOO 


1 1 1 A 
Z 1 lH 


Z30Z 


OID 5877 


296 


628 


959 


1290 


1/11 

1621 


loo / 


lite 
2115 


Z303 


FZD4 


297 


629 


a jca 

960 


1 i a 1 
1291 


lozz 


1 Q£Q 

loOo 


111/5 
Z 1 10 


Z304 


HLA-B 


298 


630 


961 


1292 


10Z3 


1 QAO 

looy 


1117 1 
Z 11 1 


Z303 


OID 5624 


299 


631 


962 


1293 


1624 


1 0*7A 
lo /U 


1 1 1 Q 
Zl lo 


Z300 


FPR1 


300 


632 


a/*i 
963 


1294 


lOZJ 


lo / 1 


1 1 1 O 

z i iy 


71^7 
Z30 / 


ODF2 


301 


633 


964 


1 1AC 

1295 


1626 


lo 11 


1 1 1A 

Z 1ZU 


71 AQ 
Z30o 


OID 51 50 


302 


634 


965 


1 ia/c 
1296 


162/ 


1 0*71 

lo 15 


1111 
Z1Z1 


71AQ 
Z30y 


OID 5639 


303 ' 


635 


966 


1297 


162o 


1 C"7/1 
lO fH 


1 1 11 
Z1ZZ 


Z3 /U i 


OID 6619 


304 


636 


967 


1298 


1629 


lo /5 


1 1 11 
Z1ZJ 


7171 i 
Z3 / 1 


OID 6933 


305 


637 


968 


1299 


1630 


1876 


1 1 1/1 
Z1Z4 


11*71 
Z3 /Z 


OID 7049 


306 


638 


969 


1 1 r\f\ 

1300 


1631 


1877 


1 1 1C 

2125 


11*71 

23/3 


IL17C 


307 


639 


970 


1301 


1632 


1 0*70 

1878 


1 1 iii 
2126 


11*7/1 

Z3 /4 


OID 5866 


308 


640 


971 


1 iai 
1302 


1633 


1879 


1 1 n 
212/ 


117C ' 

Z3 / J 


CD44 


309 


641 


972 


1 TAl 

1 303 


1634 


1 OOA 

1880 


1 1 1Q 

2128 


11*7/; 
Z3 /0 


VPS45A 


310 


642 


973 


1304 


1 635 


1 QQ1 

1881 


1 1 1Q 

zizy 


11*7*7 

Z3 / / 


OID 4932 


311 


643 


974 


1305 


1636 


looz 


1 1 1A 
Z 1 3U 


117Q 
Z3 /o 


OID 7821 


312 


£. A A 

644 


975 


1306 


1637 


1883 


nil 
Z 13 1 


71"7Q 

Z3 /y 


OID 4916 


313 


645 


976 


1 1A1 

1 307 


1638 


1 QQA 

loo4 


1 1 11 
Zl JZ 


71QA 
Z30O 


OID 4891 


314 


hah 
646 


977 


1308 


1639 


1 OOJ 


1 1 11 
Zl 


11Q1 
Z3ol 


HADHB 


315 


647 


978 


1 1AA 

1309 


low 


1 OOO 


7 1 1/1 
Z 1 J*f 


71Q1 
Z30Z 


T"**ir *S IT T 

FLJ22757 


316 


648 


979 


1310 


1641 


188 / 


one 
Zl 33 


11Q1 i 

Z3o3 ! 


RAC1 


317 


649 


980 


1311 


1642 


1 QQQ 
I OOO 


1 1 1A 
Z 130 


77Q/1 1 
Z3 54 


OID 6415 


318 


650 


981 


1312 


1643 


1 QQQ 

1889 


1117 
Zl J / 


71QC 

Z3oZ> 


NMES1 


319 


651 


982 


1313 


1644 


1890 


1 1 1Q 

2138 


no/*: 
2386 


DMBT1 


320 


652 


983 


1314 


1645 


Y OOI 

1891 


1 1 1Q 

2i3y 


11Q1 

Z3o / 


RPS23 


321 


653 


984 


1315 


1646 


1 OAO 

1892 


1 1 A A 

214U 


11 QQ 

2388 


ZF 


322 


654 


985 


1316 


1647 


1 OOI 

1893 


11/11 
2141 


11QQ 

Z3<sy 


NFE2L3 


323 


655 


AO £. 

986 


nn 

1317 


1 HA O 

1648 


1894 


ii/ii 
Z14Z 


17 AA 

Z3yu 


15 A TVA 


324 


656 


HOT 

987 


mo 
I 3 lo 




l oyj 


1 l/ll 
Z 143 


7101 

Z3y i 


OID 6295 


325 


657 


988 


1319 


1650 


1896 


2144 


2392 


DEFCAP 


326 


658 


989 


1320 


1651 


1897 


2145 


2393 


RPL27A 


327 


659 


990 


1321 


1652 


1898 


2146 


2394 


IL22 


328 


660 


991 


1322 


1653 


1899 


2147 


2395 


PSMA4 


329 


661 


992 


1323 


1654 


1900 


2148 


2396 


CCNI 


330 


662 


993 


1324 


1655 


1901 


2149 


2397 
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Gene 


SEQ ID 
50mer 


SEQ ID 
RNA/cDNA 


PCR 
Forward 
Primer 1 
SEQID 


PCR 
Reverse 
Primer 1 
SEQ ID 


PCR 
Probe 1 
SEQ ID 


PCR 
Forward 
Primer 2 
SEQID 


PCR 
Reverse 
Primer 2 
SEQID 


PCR 
Probe 2 
SEQID 


THBD 


331 


663 


994 


1325 


1656 


1902 


2150 


2398 


CGR19 


332 


664 


995 


1326 


1657 


1903 


2151 


2399 
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Gene Name 


SEQ ID 
SOmer 


SEQ ID 
RNA/cDNA 


n 


Non- 
parametric 
Odds ratio 


Fisher p- 
value 


t-test p- 
value 


unn 


HpmnolnKtn V>PtA 


ou 


no 


55 


8.33 


0.00 


0.00 


OID_4365 


Mitochondrial solute 

Loll 1C-1 


91 


423 


53 


6.16 


0.00 


0.00 


OID 873 


KIAA1892 protein 


94 


426 


55 


5.09 


0.01 


0.01 


IL4 


Interleukin 4 


80 


412 


46 


4.90 


0.02 


0.01 


nrn 4?ri 


CO J. 


107 


439 


56 


5.19 


0.01 


0.01 


IGHM 


Immunoglobulin heavy 
constant mu 


92 


424 


52 


2.89 


0.09 


0.01 


DIT VJ1V1 


mutase 


87 


419 


43 


7.31 


0.01 


0.01 


CTLA4 


Cytotoxic T^ymphocyte- 
associaicu proicin *t 


82 


414 


52 


1.84 




0.02 


ct r*n Ail 
olAs /J\ 1 1 


O r\lnfo wafT l or m tlx/ / 

ooiuic earner ianuiy / , 
(cationic amino acid 
transporter, yr sysiernj 
member 1 1 


1 IS 


467 


48 

to 


2.50 


0.15 


0.03 


IL13 


Interleukin 13 


81 


413 


29 


4.95 


0.07 


0.04 ! 


run ^?07 


EO 1 


105 


437 


37 


3.58 


0.10 


0.04 


PRDM1 


PR domain containing 1 , 
wiin ZiiNr oomain 


130 


462 


57 


1.44 




0.07 


T VM 


V-yC5>- i I alilagllCIil 

Sarcoma viroi rciateu 
oncogene homolog 


102 


434 


55 


1.08 




0.08 




£V.aI yupilCI Hi aipila V7 

(importin alpha 7) 


QO 

7u 


422 


51 


1.50 




0.09 


OID 7094 


XDx EST 479G12 


118 


450 


35 


1.13 




0.09 


IL15 


Interleukin 15 


111 


443 


51 


3.78 


0.05 


0.09 


OID 4460 


EST 


85 


417 


47 


2.73 


0.14 


0.10 


OID 7016 


EST 


127 


459 


53 


2.14 


0.27 


0.10 


ivi i or uz 


1\ VT on o 

ivieinyiene 
tetrahydrofolate 
dehydrogenase (NAD+ 
dependent), 

methenyltetrahydrofolate 
CYclohYdrola.se. 


oo 


420 


43 


3.50 


0.07 


0.11 


TCIRG1 


T-cell, immune regulator 

1 ATPacp 14 -t- 

transporting, lysosomal 
V0 protein a isoform 3 


78 


410 


57 


1.08 




0.11 




nypouiencai proicin 

ITT mOI O 




*+o 1 


*T«7 


1 OR 

1 .V70 




0.12 




f"Tii»Tn#>ViTi*» (C* Y-f^ mnh' 

v^ncmoKiric ^i^-.a.-\-> muiij 


7O 


49 R 


S6 


1 99 




0.12 


fVfDl 


v^ncmoKine inoii. 


1 1 7 


44Q 


54 


2.10 


0 27 


0.12 


<tpt 

Ux 1 


fill i/»Aor» ntmcnfinfp icnmp 

vjiucose pnospjiaic ISOITlC 


199 


4S4 


S7 


1.44 


0 60 

V/.V7V/ 


0,12 


KLRF1 


Killer cell lectin- like rece 


101 


433 


50 


1.68 




0.13 


CCL5 


Chemokine (C-C motif) 1 


98 


430 


34 


1.96 




0.13 


CD47 


CD47 antigen (Rh-relatec 


123 


455 


55 


1.45 




0.13 


IL10 


Interleukin 10 


112 


444 


33 


1.43 




0.13 


OID 253 


EST 


121 


453 


26 


1.93 




0.15 


CXCL10 


Chemokine (C-X-C motif 


109 


441 


53 


1.75 




0.16 
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Gene 


Gene Name 


SUmer 


DVA /«T\T\J A 

KINA/ClilNA 


n 


pton- 

parametric 
Odds ratio 


risner p- 
value 


i-iesi p- 
value 


IFNG 


Interferon, gamma 


113 


445 


41 


1.33 




0.16 


PRF1 


Perforin 1 (pore forming 


79 


411 


A O 

48 


1.20 




0.17 


IL2 


Interleukin 2 


100 


432 


33 


2.00 




0.17 


HLA-DRB1 


Major histocompatibility 


114 


446 


42 


1.50 




0.18 


IL6 


Interleukin 6 


99 


431 \ 


49 


1.33 




0.18 


IL2RA 


Interleukin 2 receptor, 
alpha 


103 


435 


39 


2.03 


0.34 


0.19 


OID 573 


KIAA1486 protein 


93 


425 


8 


3.00 




0.19 


CXCL9 


Chemokine (C-X-C 
motif) ligand 9 (MIG) 


108 


440 


46 


1.71 




0.20 


OID 3 


EST 


95 


427 


49 


2.19 




0.20 


CD8B1 


CD8 antigen, beta 
polypeptide 1 (p37) 


115 


447 


55 


1.21 




0.22 


CD69 


CD69 antigen (p60, 
early T-cell activation 
antigen) 


97 


429 


30 


1.71 




0.23 


OID 7605 


EST 


119 


451 


47 


3.11 


0.08 


0.24 


TNFSF6 


Tumor necrosis factor 
(ligand) superfamily, 
member 6 


77 


409 


54 


1.36 




0.25 


CXCL1 


Chemokine (C-X-C 
motif) ligand 1 
(melanoma growth 
stimulating activity, 
aloha) 


120 


452 


20 


2.00 




0.26 


OID 5350 


EST 


125 


457 


49 


2.08 


0.26 


0.28 


CD8A 


CD 8 antigen, alpha 
polypeptide (p32) 


83 


415 


57 


1.39 




0.28 


CD4 


CD4 antigen (p55) 


116 


448 


55 


1.64 




0.28 


PTGS2 


Prostaglandin- 
endoperoxide synthase 2 
(prostaglandin G/H 
synthase and 
cvclooxveenase) 


128 


460 


46 


2.05 


0.37 


0.29 


GZMB 


Granzyme B (granzyme 
2, cytotoxic T- 
lymphocyte-associated 
serine esterase 1) 


76 


408 


40 


1.81 




0.33 


CCL4 


Chemokine (C-C motif) 
lisand 4, SCYA4 


104 


436 


53 


2.25 




0.35 


ChGn 


Chondroitin beta 1 ,4 N- 

acetylgalactosaminyltran 

sferase 


106 


438 


31 


2.57 




0.36 


TCRGC2 


T cell receptor gamma 
constant 2 


126 


458 


52 


1.33 




0.39 


HLA-F 


Major histocompatibility 
complex, class I, F 


i i/i 
1Z4 


A 

4D0 






n 17 

U. i / 




TAP1 


Transporter 1, ATP- 
binding cassette, sub- 
familv B (MDR1/TAP) 


89 


421 


36 


1.93 




0.45 
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Gene 


Gene Name 


SEQ ID 
SOmer 


SEQID 
RNA/cDNA 


n 


Non- 
parametric 

* 

Odds ratio 


Fisher p- 
value 


t-test p- 
value 


BY55 


Natural killer cell 
receptor, 
immunoglobulin 
suoerfamilv member 


84 


416 


52 


2.49 


0.16 


0.48 


IL8 


Interleukin 8 


75 


407 


49 


2.10 


0.26 


0.49 
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Gene 


ACC 


SEQ ID 
50mer 


SEQ ID 
RNA/cDNA 


RefSeq Peptide 
Accession # 


SEQ ID Protein 


ACTB 


NM 001101 


2 


334 


NP 001092 


2400 | 


GUSB 


NM 000181 


3 


335 


NP_000172 


2401 ! 


B2M 


NM 004048 


4 


336 


NP 004039 


2402 


TSN 


NM 004622 


5 


337 


NP 004613 


2403 


CCR7 


NM 001838 


6 


338 


NP 001829 


2404 


IL1R2 


NM 004633 


7 


339 


NP 004624 


2405 


AIF-1 


NM 004847 


8 


340 


NP 004838 


2406 


ALAS2 


NM 000032.1 


9 


341 


NP 000023 


2407 


APELIN 


NM 017413 


10 


342 


NP 059109 


2408 


CD80 


NM 005191 


11 


343 


NP 005182 


2409 


EPB41 


NM 004437 


12 


344 


NP 004428 


2410 


CBLB 


NM 004351 


13 


345 


NP 733762 


2411 


CCR5 


NM 000579 


14 


346 


NP 000570 


2412 


MME 


NM 000902 


15 


347 


NP 000893 


2413 


KLRC1 


NM 002259 


16 


348 


NP 002250 


2414 


FCGR3A 


NM 000569 


17 


349 


NP 000560 


2415 


FCGR3B 


NM 000570 


18 


350 


NP 000561 


2416 


LAG3 


NM 002286 


19 


351 


NP 002277 


2417 


PEC AMI 


NM 000442 


20 


352 


NP 000433 


2418 


CD34 


NM 001773 


21 


353 


NP 001764 


2419 


FCGR1A 


NM 000566 


22 


354 


NP 000557 


2420 


TFRC 


NM 003234 


23 


355 


NP 003225 


2421 


CMA1 


NM 001836 


24 


356 


NP 001827 


2422 j 


KIT 


NM 000222 


25 


357 


NP 000213 


2423 


MPL 


NM 005373 


26 


358 


NP 005364 


2424 


EphB6 


NM 004445 


27 


359 


NP 004436 


2425 


EPO-R 


NM 000121.2 


28 


360 


NP 000112 


2426 


Foxp3 


NM 014009 


29 


361 


NP 054728 


2427 


GATA-1 


NM 002049 


30 


362 


NP 002040 


2428 


ITGA2B 


NM 000419 


31 


363 


NP 000410 


2429 


GNLY 


NM 006433 


32 


364 


NP 006424 


2430 | 


GZMA 


NM 006144 


33 


365 


NP 006135 


2431 


HBA 


NM 000558.3 


34 


366 


NP 000549 


2432 


HBZ 


NM 005332.2 


35 


367 


NP 005323 


2433 


HBD 


NM 000519.2 


37 


369 


NP 000510 


2434 


HBE 


NM 005330 


38 


370 


NP 005321 


2435 


HBG 


NM 000559.2 


39 


371 


NP 000550 


2436 


HBQ 


NM 005331 


40 


372 


NP 005322 


2437 


HLA-DP 


NM 033554 


41 


373 


NP 291032 


2438 


HLA-DQ 


NM 002122 


42 


374 


NP 002113 


2439 


ICOS 


NM 012092 


44 


376 


NP 036224 


2440 


IL18 


NM 001562 


45 


377 


NP 001553 


2441 


IL3 


NM 000588 


46 


378 


NP 000579 


2442 


ITGA4 


NM 000885 


47 


379 


NP 000876 


2443 


ITGAM 


NM 000632 


48 


380 


NP 000623 


2444 


ITGB7 


NM 000889 


49 


381 


NP 000880 


2445 


CEBPB 


NM 005194 


50 


382 


NP 005185 


2446 


NF-E2 


NM 006163 


51 


383 


NP 006154 


2447 


PDCD1 


NM 005018 


52 


384 


NP 005009 


2448 


PF4 


NM 002619 


53 


385 


NP 002610 


2449 


PRKCQ 


NM 006257.1 


54 


386 


NP 006248 


2450 


PPARGCl 


NM 013261 


55 


387 


NP 037393 


2451 


RAG1 


NM 000448 


56 


388 


NP 000439 


2452 


RAG2 


NM 000536 


57 


389 


NP 000527 


2453 


CXCL12 


NM 000609 


58 


390 


NP 000600 


2454 


TNFRSF4 


NM 003327 


59 


391 


NP 003318 


2455 
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TNFSF4 


NM 003326 


60 


392 


NP 003317 


2456 


TPS1 


NM 003293 


61 


393 


NP 003284 


2457 


ADA 


NM 000022 


62 I 


394 


NP 000013 


2458 


CPM 


NM 001874.1 


63 


395 


*v TTk AA1 O f & 

NP 001865 


2459 


CSF2 


NM 000758.2 


64 


396 


NP 000749 


2460 


CSF3 


NM 172219 


65 


397 


NP 757373 


2461 


CRP 


NM 000567.1 


66 


398 


NP 000558 


2462 


FLT3 


NM 004119 


67 


399 


NP 004110 


2463 


GATA3 


NM 002051.1 


68 


400 


NP 002042 


2464 


IL7R 


NM 002185.1 


69 


401 


NP 002176 


2465 


KLF1 


NM 006563.1 


70 


402 


NP 006554 


2466 


LCK 


NM 005356.2 


71 


403 


NP 005347 


2467 I 


LEF1 


NM 016269.2 


72 


404 


NP 057353 


2468 


PLAUR 


NM 002659.1 


73 


405 


NP 002650 


2469 


TNFSF13B 


NM 006573.3 


74 


406 


NP__006564 


2470 


IL8 


NM 000584 


75 


407 


NP 000575 


2471 | 


GZMB 


NM 004131 


76 


408 


NP 004122 


2472 


TNFSF6 


NM 000639 


77 


409 


NP 000630 


2473 


TCIRG1 


NM 006019 


78 


410 


NP 006010 


2474 


PRF1 


NM 005041 


79 


411 


NP 005032 


2475 


IL4 ] 


NM 000589 


80 


412 


NP 000580 


2476 


IL13 


NM 002188 


81 


413 


NP 002179 


2477 


CTLA4 


NM 005214 


82 


414 


NP 005205 ! 


2478 


CD8A 


NM 001768 


83 


415 


NP 001759 


2479 


BY55 


NM 007053 


84 


416 


NP 008984 


2480 


HBB 


NM 000518 


86 


418 


NP 000509 


2481 


BPGM 


NM 001724 


87 


419 


NP 001715 


2482 


MTHFD2 


NM 006636 


88 


420 


NP 006627 j 


2483 


TAP1 


NM 000593 


89 


421 


NP 000584 


2484 


OID 873 


AK000354 


94 


426 


NP 056212 


2485 


CXCR4 


NM 003467 


96 


428 


NP 003458 


2486 


CD69 


NM 001781 


97 


429 


NP 001772 


2487 


CCL5 


NM 002985 


98 


430 


NP 002976 


2488 


IL6 


NM 000600 


99 


431 


NP 000591 


2489 


IL2 


NM 000586 


100 


432 


NP 000577 


2490 


KLRF1 


NM_0 16523 


101 


433 


NP 057607 


2491 


LYN 


NM 002350 


102 


434 


NP 002341 i 


2492 


IL2RA 


NM 000417 


103 


435 


NP 000408 


2493 


CCL4 


NM 002984 


104 


436 


NP 002975 


2494 


ChGn 


NM 018371 


106 


438 


NP 060841 


2495 


CXCL9 


NM 002416 


108 


440 


NP 002407 


2496 


CXCL10 


NM 001565 


109 


441 


NP 001556 


2497 


IL17 


NM 002190 


110 


442 


NP 002181 


2498 


IL15 


NM 000585 


111 


443 


NP 000576 


2499 


IL10 


NM 000572 


112 


444 


NP 000563 j 


2500 


IFNG 


NM 000619 


113 


445 


NP 000610 ! 


2501 


HLA-DRB1 


NM 002124 


114 


446 


NP 002115 j 


2502 


CD8B1 


NM 004931 


115 


447 


NP 004922 


2503 


CD4 


NM_000616 


116 


448 


NP_000607 


2504 


CXCR3 


NM 001504 


117 


449 


NP 001495 


2505 


CXCL1 


X.TX A f\f\ 1 C 1 1 

NM 001511 


120 


452 


\m f\C\ 1 Cf\1 1 

NP 001 dUZ 




GPI 


NM 000175 


122 


454 


NP 000166 


2507 


CD47 


NM 001777 


123 


455 


NP 001768 


2508 


HLA-F 


NM 018950 


124 


456 


NP 061823 


2509 


PTGS2 


NM 000963 


128 


460 


NP 000954 


2510 


OID 5847 


NM 144588 


129 


461 


NP 653189 


2511 
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PRDM1 


NM 001198 


130 


462 


NP 001189 


2512 


CKB 


KM 001823 


131 


463 


NP 001814 


2513 


TNNI3 


NM 000363 


132 


464 


NP 000354 


2514 


TNNT2 


NM 000364 


133 


465 


NP 000355 


2515 ! 


MB 


NM 005368 


134 


466 


NP 005359 


2516 


SLC7A11 


NM 014331 


135 


467 


NP 055146 


2517 


TNFRSF5 


NM 001250 


136 


468 


NP 001241 


2518 


TNFRSF7 


NM 001242 


137 


469 


NP 001233 


2519 


CD86 


NM 175862 


138 


470 


NP 787058 


2520 


AIFlv2 


NM 004847 


139 


471 


NP 004838 


2521 


CMV !Ele3 


NC 001347, compl 


144 


476 


NP 040060 


2522 


CMV IEle4 


NC 001347, compl 


145 


477 


NP 040060 


2523 I 


EV EBNA-1 


NC 001345, 10795 


146 


478 


NP 039875 


2524 


EVBZLF-1 


NC 001345, compl 


147 


479 


NP 039871 


2525 


CMVIE1 


NC 001347, compl 


150 


482 


NP 040060 


2526 


CMV IE1 


NC 001347, compl 


151 


483 


NP 040060 


2527 


CLC 


NM 001828 


152 


484 


NP 001819 j 


2528 


TERF2IP 


NM 018975 


153 


485 


NP 061848 


2529 | 


HLA-A 


NM 002116 


154 


486 


NP 002107 


2530 


MSCP 1 


NM 018579 


156 


488 


NP 061049 


2531 


DUSP5 1 


NM 004419 


157 


489 


NP 004410 


2532 


PRO 1853 


NM 018607 


158 


490 


NP 061077 


2533 


CDSN 


NM 001264 


160 


492 1 


NP 001255 


2534 ! 


RPS25 


NM 001028 


162 


494 


NP 001019 


2535 


GAPD 


NM 002046 


163 


495 


NP 002037 


2536 


RPLP1 ! 


NM 001003 


164 


496 


NP 000994 


2537 


POLR2D 


NM 004805 


168 


500 


NP 004796 


2538 


ARPC3 


NM 005719 


169 


501 


NP 005710 


2539 


FPRL1 


NM 001462 


173 


505 


NP 001453 


2540 


FKBPL 


NM 022110 


174 


506 


NP 071393 


2541 


PREB 


NM 013388 


175 


507 


NP 037520 


2542 j 


OID 7595 


NM 015653 


177 


509 


NP 056468 


2543 


RNF19 


NM 015435 


178 


510 


NP 056250 


2544 


SMCY 


NM 004653 


179 


511 


NP 004644 


2545 


OID 7504 


NM 152312 


181 


513 


NP 689525 


2546 


DNAJC3 


NM 006260 


182 


514 


NP 006251 


2547 


ARHU 


NM 021205 


183 


515 


NP 067028 


2548 i 


OID 7200 


NM 022752 


184 


516 


NP 073589 


2549 J 


SERPINB2 


NM 002575 


185 


517 


NP 002566 


2550 


ENOl 


NM 001428 


186 


518 


NP 001419 


2551 | 


MOP3 


NM 018183 


193 


525 


NP 060653 


2552 


RPL18A 


NM 000980 


194 


526 


NP 000971 


2553 


INPP5A 


NM 005539 


195 


527 


NP 005530 


2554 


RPS29 


NM 001032 


197 


529 


NP 001023 


2555 


VNN2 


NM 004665 


200 


532 


NP 004656 


2556 [ 


OID 7703 


NM 014949 


201 


533 


NP 055764 


2557 


EEF1A1 


NM 001402 


205 


537 


NP 001393 


2558 


PA2G4 


NM 006191 


206 


538 


NP 006182 


2559 


GAPD 


NM 002046 


207 


539 


NP 002037 


2560 


CHD4 


NM 001273 


208 


540 


NP 001264 


2561 


OID 7951 


NM 016521 


209 


541 


NP 057605 




DAB1 


NM 021080 


210 


542 


NP 066566 


2563 


OID 3406 


NM 018986 


211 


543 


NP 061859 


2564 


S100A8 


NM 002964 


215 


547 


NP 002955 


2565 


ERCC5 


NM 000123 


217 


549 


NP 000114 


2566 


RPS27 


NM 001030 


218 


550 


NP 001021 


2567 _ 
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ACRC 


NM 052957 


219 


551 


NP 443189 


2568 


UBL1 


NM 003352 


226 


558 


NP 003343 


2569 


SH3BGRL3 


NM 031286 


229 


561 


NP 112576 


2570 


SEMA7A 


NM 003612 


230 


562 


NP 003603 


2571 


IL21 


NM 021803 


233 


565 


NP 068575 


2572 


HERC3 


NM 014606 


234 


566 


NP 055421 


2573 


Pll 


NM 006025 


236 


568 


NP 006016 


2574 


TIMM10 


NM 012456 


238 


570 


NP 036588 


2575 


EGLN1 


AJ3 10543 


239 


571 


NP 071334 


2576 


TBCC 


NM 003192 


240 


572 


NP 003183 


2577 


RNF3 


NM 006315 


241 


573 


NP 006306 


2578 


CCNDBP1 


NM 012142 


243 


575 


NP .036274 


2579 1 


SUV39H1 


NM 003173 


245 


577 


NP 003164 


2580 


HSPC048 


NM 014148 


246 


578 


NP 054867 


2581 


WARS 


NM 004184 


248 


580 


NP 004175 


2582 


SERPINB11 


NM 080475 


254 


586 


NP 536723 


2583 


RNF10 


NM 014868 


256 


588 


NP 055683 


2584 


MKRN1 


NM 013446 


257 


589 


NP 038474 


2585 | 


RPS16 


NM 001020 


258 


590 


NP 001011 


2586 


BAZ1A 


NM 013448 


259 


591 


NP 038476 


2587 


ATP5L 


NM 006476 


261 


593 


NP 006467 


2588 


NCBP2 


NM 007362 


264 


596 


NP 031388 


2589 


HZF12 


NM 033204 


266 


598 


NP 149981 


2590 J 


CCL3 


D90144 


267 


599 


NP 002974 


2591 


PRDX4 


NM 006406 


270 


602 


NP 006397 


2592 


BTK 


NM 000061 


271 


603 


NP 000052 


2593 


PGK1 


NM 000291 


273 


605 


NP 000282 


2594 


TNFRSF10A 


NM 003844 


274 


606 


NP 003835 


2595 


ADM 


NM 001124 


275 


607 


NP 001115 


2596 ! 


C20orf6 


NM 016649 


277 


609 


NP 057733 


2597 


OID 3226 


NM 015393 


278 


610 


NP 056208 


2598 


ASAH1 


NM 004315 


279 


611 


NP 004306 


2599 


ATF5 


NM 012068 


280 


612 


NP 036200 


2600 


OID 4887 


NM 032895 


281 


613 


NP 116284 


2601 


MDM2 


NM 002392 


283 


615 


NP 002383 


2602 


XRN2 


AF064257 


284 


616 


NP 036387 


2603 


PRKRA 


NM 003690 


288 


620 


NP 003681 


2604 


CDKN1B 


NM 004064 


293 


625 


NP 004055 


2605 


PLOD 


NM 000302 


294 


626 


NP 000293 


2606 


FZD4 


NM 012193 


297 


629 


NP 036325 


2607 


HLA-B 


NM 005514 


298 


630 


NP 005505 


2608 


FPR1 


NM 002029 


300 


632 


NP 002020 


2609 


ODF2 


NM 153437 


301 


633 


NP 702915 


2610 


IL17C 


NM 013278 


307 


639 


NP 037410 


2611 1 


VPS45A 


NM 007259 


310 


642 


NP 009190 


2612 


HADHB 


NM 000183 


315 


647 


NP 000174 


2613 


FLJ22757 


NM 024898 


316 


648 


NP 079174 


2614 


NMES1 


NM 032413 


319 


651 


NP 115789 


2615 


DMBT1 


NM 007329 


320 


652 


NP 015568 


2616 


RPS23 


NM 001025 


321 


653 


NP 001016 


2617 


ZF 


NM 021212 


322 


654 


NP Uo/U3j 


ZO 1 5 


NFE2L3 


NM 004289 


323 


655 


NP 004280 


2619 


RAD9 


NM 004584 


324 


656 


NP 004575 


2620 


DEFCAP 


NM 014922 


326 


658 


NP 055737 


2621 


IL22 


NM 020525 


328 


660 


NP 065386 


2622 


PSMA4 


NM 002789 


329 


661 


NP 002780 


2623 
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CCNI 


NM 006835 


330 


662 


NP 006826 


2624 


THBD 


NM 000361 | 


331 


663 


NP 000352 


2625 1 


CGR19 


NM 006568 I 


332 


664 


NP 006559 


2626 


HSRRN18S 


X03205 


1 


333 i 






HBB 


NG 000007 


36 


368 1 






HLA-DRB 




43 


375 






OID 4460 


AF 150295 


85 


417 






KPNA6 


AW021037 


90 


422 






OID 4365 


All 14652 


91 


423 






IGHM 


BC032249 


92 


424 






OID 573 


AB040919 


93 


425 






OID 3 


AW968823 


95 


427 






OID 6207 


D20522 


105 


437 






OID 4281 


AA053887 


107 


439 






OID 7094 




118 


450 






OID 7605 ! 


AA808018 


119 


451 






OID 253 


AK091125 


121 ! 


453 






OID 5350 ! 


AK055687 


125 


457 






TCRGC2 


M17323 


126 


458 






OID 7016 i 


BIO 18696 


127 


459 






EV EBV 




141 


473 






CMV p67 


NC 001347 


142 


474 






CMV TRL7 




143 


475 






EV EBN 




148 


480 






EV EBNA-LP 




149 


481 






OID 5891 


AW297949 


155 


487 






OID 6420 


AK090404 


159 


491 






OID 4269 


BM727677 


161 


493 






OID 5115 


AI364926 


165 


497 






SLC9A8 


AB023156 


166 


498 






OID 1512 


BE6 18004 


167 


499 






OID 6282 


BC041913 


170 


502 






PRO1073 


AF001542 


171 


503 






OID 7222 


BG260891 


172 


504 






OID 1551 


BE887646 


176 


508 






OID 4184 


XI 7403 


180 


512 






OID 7696 


AW297325 


187 


519 






OID 4173 


XI 7403 


188 


520 






CSF2RB 


AL540399 


189 


521 






OID 7410 


AW837717 


190 


522 






OID 4180 


X17403 


191 


523 






OID 5101 


BG461987 


192 


524 






MAN7 


BG772661 


196 


528 






OID 6008 


AW592876 


198 


530 






OID 4186 


XI 7403 


199 


531 






OID 7057 


480F8 


202 


534 






OID 4291 


BC038439 


203 


535 






OID 1366 


AW850041 


204 


536 






OID 6986 


AK093608 


212 


544 






OID 5962 


AW452467 


213 


545 






OID 5152 


AI392805 


214 


546 






HNRPU 


BM467823 


216 


548 






PSMD11 


AI684022 


220 


552 






OID 1016 


AK024456 


221 


553 






OID 1309 


AV706481 


222 


554 






OID 7582 


AK027866 


223 


555 







197 



WO 2004/042346 



PCT/US2003/012946 



Gene 


ACC 


SILQ ID 
SOmer 


SEQ ID 
RNA/cDNA 


Keii>eq repiicie 
Accession # 


htLKl iu Protein 


OID 4317 


AI3 18342 


224 


556 






OID 5889 


AW297843 


225 


557 






OID 3687 


W03955 


227 


559 






OID 7371 


BE730505 


228 


560 






ODD 5708 


AW081540 


231 


563 






OID 5992 


AW467992 


232 


564 






OID 7799 


AW837717 


235 


567 






OID 7766 


AW294711 


237 


569 






OID 6451 


AL834168 


242 


574 






OID 8063 


X68264 


244 


576 






OID 5625 


AW063780 


247 


579 






OID 6823 


AL832642 


249 


581 : 






OID 7073 


AL705961 


250 


582 






OID 5339 


AI625119 


251 


583 






OID 4263 


AA136584 


252 


584 






MGC26766 


AK025472 


253 


585 






OID 6711 


BF968628 


255 


587 






OID 5998 


AW468459 


260 


592 






OID 6393 


52B9 


262 


594 






RoXaN 


BC004857 


263 


595 






OID 6273 


AW294774 


265 


597 






OID 4323 


AA744774 


268 


600 






OID 5181 


AI400725 


269 


601 






OID 6298 


AI948513 


272 


604 






OID 357 


138G5 


276 


608 






OID 4239 


BQ022840 


282 


614 






OID 6039 


BE502246 


285 


617 






OID 4210 


AI300700 


286 


618 






OID 7698 


AA243283 


287 


619 






OID 4288 


AI378046 


289 


621 






OID 5620 


AW063678 


290 


622 






OID 7384 


BF475239 


291 


623 






OID 1209 


C14379 


292 


624 






OID 5128 


AK097845 


295 


627 






OID 5877 


AW297664 


296 


628 






OID 5624 


AW063921 


299 


631 






OID 5150 


AI392793 


302 


634 






OID 5639 


AW064243 


303 


635 






OID 6619 


469A10 


304 


636 






OID 6933 


AI089520 


305 


637 






OID 7049 


480E2 


306 


638 






OID 5866 


BM684739 


308 


640 






CD44 


AA9 16990 


309 


641 






OID 4932 


AA457757 


311 


643 






OID 7821 


AA743221 


312 


644 






OID 4916 


AA252909 


313 


645 






OID 4891 


AL832329 


314 


646 






RAC1 


AK054993 


317 


649 






OID 6415 


CA407201 


318 


650 






OID 6295 


AI880607 


325 


657 






RPL27A 


BF214146 


327 


659 
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Table 3: Viral genomes were used to design oligonucleotides for the microarrays. 
The accession numbers for the viral genomes used are given, along with the gene 
name and location of the region used for oligonucleotide design, 



Virus 


Gene Name 


Genome Location 




Ela 


1226.. 1542 




Elb_l 


3270... 3503 




E2a_2 


complement(24089..25885) 


Adenovirus, type 2 


E3-1 


27609..29792 


Accession #J01917 


E4 (last exon at 3' -end) 


complemem(33 1 93..32802) 




IX 


3576..4034 




lva2 


complement(408 1 - 54 1 7) 




DNA Polymerase 


complement^ 1 87. .541 8) 




HCMVTRL2 (IRL2) 


1893..2240 




HCMVTRL7 (IRL7) 


complement(6595.6843) 




HCMVUL21 


complement(26497..27024) 




HCMVTJL27 


complement(3283 1 ..34657) 




HCMVUL33 


43251-44423 




HCMVUL54 


complement(76903..8063 1 ) 


v,y Iomega loviru a 


HCMVUL75 


complement 1 0790 1 . . 1 1 0 1 32) 


(CMV) 


HCMVUL83 


complement(119352..l21037) 


Accession #X17403 


HCMVUL106 


complement( 1 54947. . 1 553 24) 




HCMVUL109 


complement 1 575 1 4. . 1 578 1 0) 




HCMVUL1 13 


161503..162800 j 




HCMVUL122 


complement(169364..170599) 




HCMVUL123 (last exon at 3'-end) 


complement(l 7 1 006. . 1 72225) 




HCMVUS28 


219200..220171 




Exon in EBNA-1 RNA 


67477..67649 




Exon in EBNA-1 RNA 


98364..98730 


Epstein-Bar r virus 


BRLFl 


complement 1 03366. . 1 05 1 83) 


(EBV) 


BZLF1 (first of 3 exons) 


complement 1 02655.. 1 03 1 55) 


Accession #NC_001 345 


BMLF1 


complement(82743..84059) 




BALF2 


complement 1 6 1 384..1 64770) 




U16/U17 


complement(26259..27349) 




U89 


complement 1 3309 1 .. 1 3561 0) 




U90 


complement 1 35664.. 1 35948) 




U86 


complement(125989..128136) 




U83 


123528. 123821 




U22 


complement(33739..34347) 


Human Herpesvirus 6 


DR2 (DR2L) 


791. .2653 


(HHV6) 


DR7 (DR7L) 


5629..6720 


Accession #NC__001664 


U95 


142941. .146306 




U94 


complemen t( 1 4 1 394. . 1 42866) 




U39 


complement(59588..62080) 




U42 


complement(69054..70598) 




U8I 


complement 121810-1 22577) 




U91 


136485.. 136829 
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Table 4: Dependent variables for discovery of gene expression markers of cardiac 
allograft rejection. 



Dependent 
Variable 


Description 


Number of 
Rejection 
Samples 


Number of 
No-Rejection 
Samples 


0 vs 1-4 Bx 


Grade 0 vs. Grades 1-4, local biopsy reading 


65 


114 


SU VS 1D-H XI VJ 


Cf a V»|p C\rr*Af* n vc r^rarlpQ 1 R~4 hioflPQt 
OlaDlC VJIaQC \J V5 VJldUCS lX>-*t, IligllCdl 

grade, Grade 1 A not included 


41 


57 


U-1A VS 1 nU 


VjIdQCb w dllU 1 r\ V5» VJlaUCa JJD-t, IllgllCbl 

grade. 


121 


58 


Ovs 3A HG 


Grade 0 vs Grade 3 A, highest grade. Grades 
1 A-2 and Grade 3B were not included. 


56 


29 


Ovs 1B-4 


Grade 0 vs Grades 1B-4, highest grade. Grade 
1A was not included. 


57 


57 l 


Ovs 1A-4 


Grade 0 vs. Grades 1-4, highest grade 


56 


123 
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Table 5: Real-time PCR assay chemistries. Various combinations of reporter and 
quencher dyes are useful for real-time PCR assays. 



Reporter 


Quencher 


FAM 


TAMRA 


BHQ1 


TET 


TAMRA 


BHQ1 


JOE 


TAMRA 


BHQ1 


HEX 


TAMRA 


BHQ1 


VIC 


TAMRA 


BHQ1 


ROX 


BHQ2 


TAMRA 


BHQ2 



201 



WO 2004/042346 
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Gene 
Array 
Probe 
SEQ 


Phase 1 


Phase 2 


All Data 


ID 


Fold 


t-Test 


KID 


K 


rOlu 


* Tact 

I- i esi 


KIR 


D 

r\ 


rOlu 


i- 1 esi 


KID 

nix 




95 


1.093 


0.36084 


a ft 

1U 


o 
8 










r\ no a 

0.935 


ft o a a a o 
U.Olo4o 


OA 


A O 
1 0 


111 


1.415 


0.0095 


12 


10 










1.415 


ft ftftftC 

0.0095 


A O 

12 


A ft 
1U 


79 


1.822 


0.01146 


6 


7 


0.63 


0.04185 


19 


15 


0.72 


ft rtCCOl 

0.05632 


35 


Oft 


3016 


1.045 


0.41017 


12 


10 










A ftft A 

1.001 


0.49647 


A C 

16 


A C 


75 


0.84 


0.36674 


11 


8 


0.595 


0.15788 


16 


13 


0.628 


0.08402 


34 


0£5 

26 


2765 


1.653 


0.01508 


10 


10 


0.776 


0.11082 


19 


14 


0.956 


0.37421 


38 


Oft 

29 


97 










0.75 


0.26201 


8 


8 


0.543 


0.1 1489 


17 


4 O 

12 


2635 


1.553 


0.00533 


13 


10 


0.834 


0.16853 


18 


15 


ft ftOO 

0.988 


ft ACACXA 

0.46191 


3b 


0"7 
Zf 


96 


1.495 


0.06288 


13 


9 


1.157 


0.27601 


18 


15 


1.155 


l> A4 ftftO 

0.21095 


OO 
OO 


OR 

ZO 


100 


1.43 


0.166 


10 


5 










A A ftO 

1 .408 


ft A A AAQ 

0. 1441o 


A O 
1Z 


o 
o 


2766 


0.956 


0.43918 


12 


10 


0.989 


0.48275 


A ft 

19 


14 


ft ft "TO 

0.978 


ft ACAf\A 

0.45101 


O A 
Ol 


0>! 
Z4 


2726 


1.037 


0.38205 


11 


9 










1.037 


ft OOOftC 

0.38205 


A A 
1 1 


ft 

y 


2768 


1.211 


0.02386 


9 


9 










A 04 A 
l.Z 1 1 


U.UZooo 


ft 

y 


y 


94 


1.601 


0.02418 


11 


10 










A OO A 
1 .831 


ft ftftftft>i 
0.uuuy4 


A7 


1 ft 
1 o 


2769 


1.133 


0.23094 


12 


9 


1.081 


ft A ftCOO 

0.19632 


A ft 

19 


A t~ 

15 


1. lOl 


ft A CAOO 


0«1 
Ol 


OA 
Z4 


2770 


1.734 


0.00017 


A O 

13 


A ft 

10 










A QQ1 


n f\A ooo 
U.U1 0Z0 


on 
zu 


1 R 


! 2647 


1.557 


0.04502 


10 


8 










A CC7 

1.557 


ft f\A CftO 

U.U40UZ 


1U 


Q 

O 


\ 2771 


1.99 


0.05574 


13 


9 










A CO 


A A A A A.Q 


A 7 


I o 


82 


2.029 


0.00022 


8 


5 


1.287 


ft A OftOO 

0.13022 


A O 

18 


A A 

14 


A OCC 

1.25b 


ft ncocc 
0.0535b 


OO 
OO 


OO 


83 


1.546 


0.05865 


13 


10 


0.577 


0.03934 


A Q 

18 


14 


ft ~rr\ c 

0.795 


ft 4 4 ftftO 

0.1 1993 


OQ 

oy 


Oft 


98 










0.716 


0.13 


19 


15 


0.577 


ft ftOOCO 

0.03352 


* ft 

19 


14 


36 


1.605 


0.09781 


12 


8 


2.618 


0.01227 


18 


1 1 


2.808 


ft ftftft4 C 

0.00015 


OO 

3o 


OO 

zo 


80 


5.395 


0.00049 


9 


6 


4.404 


0.05464 


10 


10 


2.33 


ft rtOOCrt 

0.02369 


Oft 

29 


1o 


89 


















0.295 


ft noocc 

0.02856 


6 


D 


77 


1.894 


0.01602 


10 


10 


0.537 


0.01516 


A ft 

19 


15 


ft oco 

0.863 


ft OAQQ7 

o.ziyo/ 


OK 
OO 


OQ 

zy 


2772 


1.583 


0.06276 


10 


6 


0.714 


4\ At At t*\ 

0.13019 


13 


10 


1.136 


ft OOO A A 

0.28841 


OO 

zo 


A 7 

17 


2773 


1.391 


0.09236 


11 


6 










A O ft A 

1.391 


ft noooc 


A A 


D 


2774 


1.59 


0.00022 


13 


10 










A Cft 

1 .59 


ft ftftftOO 


A O 
IO 


A ft 


102 


1.245 


0.05079 


11 


10 


1.018 


0.42702 


17 


15 


1.117 


ft AOOOO 

0.08232 


OO 

32 


OO 

Zo 


2775 


0.719 


0.16243 


11 


9 










ft ^ A ft 

0.719 


ft <(COilQ 

U. ioZ4o 


A A 


ft 

y 


2776 


1.257 


0.0516 


12 


9 










1 .257 


U.UOID 


A O 
1Z 


y 


2667 


1.343 


0.03806 


13 


9 










1.13 


ft -4 COCO 

0.15962 


Oft 

ZU 


4 o 


115 


1.199 


0.26299 


11 


9 










1.199 


ft oconn 
0.26299 


1 1 


y 


2669 


*> 14R 
i*tu 


ViUUO 1 O 


1° 


10 










1.296 


0.14285 


18 


12 


2777 


1.142 


0.20245 


13 


10 










1.142 


0.20245 


13 


10 


78 


1.324 


0.01985 


12 


9 


0.967 


0.33851 


18 


14 


1.007 


0.46864 


I 38 


24 


2670 


1.388 


0.11209 


13 


9 










1.388 


0.11209 


13 


9 


I 88 


1.282 


0.14267 


7 


7 


0.995 


0.48504 


17 


14 


1.008 


0.47383 


30 


23 


2778 


1.128 


0.19528 


13 


9 










1.128 


0.19528 


13 


9 


2779 


1.991 


0.02513 


9 


5 


0.642 


0.05002 


18 


14 


0.868 


0.26275 


32 


21 


2780 


1.597 


0.00355 


13 


10 


0.802 


0.11649 


17 


14 


1.013 


0.45521 


38 


26 


2781 










0.492 


0.01344 


12 


12 


0.819 


0.25555 


17 


15 
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Gene 
Array 
Probe 


Phase 1 | 


Phase 2 




All Data 






SEQ 
ID 


Fold 


t-Test 


NR 


R 


Fold 


t-Test 


NR 


R 


Fold 


t-Test 


NR 


R 


101 










0.652 


0.04317 


19 


15 


0.773 


0.09274 


29 


22 


106 


1.234 


0.19141 


13 


8 










1.234 


0.19141 


13 


8 


2683 


1.598 


0.03723 


8 


8 


0.633 


0.03893 


14 


10 


0.86 


0.18731 


28 


22 


2782 


1.213 


0.03305 


12 


10 


0.912 


0.07465 


19 


15 


0.969 


0.31955 


39 


27 


87 










4.947 


0.02192 


18 


15 


3.857 


0.00389 


30 


23 


j 99 


0.639 


0.06613 


7 


5 


0.839 


0.30304 


16 


8 


0.694 


0.04347 


27 


15 


2692 


0.801 


0.21236 


12 


8 


0.893 


0.33801 


18 


15 


0.782 


0.06938 


38 


25 


104 


2.292 


0.0024 


11 


8 


0.621 


0.05152 


19 


15 


0.913 


0.34506 


30 


23 


I 76 


1.809 


0.00893 


9 


8 


0.693 


0.13027 


13 


8 


1.274 


0.11887 


28 


19 


91 


1.969 


0.07789 


11 


8 


4.047 


0.00812 


19 


13 


3.535 


0.00033 


37 


23 


92 


2.859 


0.05985 


11 


8 


9.783 


0.03047 


18 


14 


8.588 


0.00192 


37 


24 


85 


0.95 


0.43363 


12 


8 


0.699 


0.0787 


13 


13 


0.633 


0.01486 


33 


24 


126 


1.76 


0.02199 


11 


10 










1.76 


0.02199 


11 


10 


2783 


0.945 


0.46023 


10 


5 


0.852 


0.26701 


17 


10 


0.986 


0.48609 


29 


17 


2707 


1.055 


0.31435 


13 


10 










1.055 


0.31435 


13 


10 


123 


1.154 


0.11677 


11 


10 










1.154 


0.11677 


11 


10 


84 


1.786 


0.00255 


9 


6 


0.523 


0.04965 


18 


14 


0.785 


0.14976 


34 


22 


2784 


2.12 


0.00022 


12 


10 


0.498 


0.01324 


18 


13 


0.935 


0.37356 


37 


25 


2785 


1.181 


0.1377 


10 


10 










1.181 


0.1377 


10 


10 


124 


1.353 


0.08122 


11 


9 










1.353 


0.08122 


11 


9 


90 


1.355 


0.02288 


13 


10 


0.973 


0.39248 


15 


13 


1.125 


0.08671 


28 


23 


2786 


1.306 


0.0773 


12 


10 










1.306 


0.0773 


12 


10 


2787 


1.086 


0.32378 


12 


10 










1.086 


0.32378 


12 


10 


3018 


1.523 


0.1487 


12 


10 


0.84 


0.27108 


18 


13 


1.101 


0.33276 


36 


26 


125 


1.252 


0.05782 


11 


10 










1.252 


0.05782 


11 


10 


2788 


1.255 


0.1221 


11 


10 










1.255 


0.1221 


11 


10 


2789 


1.152 


0.31252 


9 


6 










1.152 


0.31252 


9 


6 


3019 


1.268 


0.21268 


6 


7 


0.981 


0.45897 


16 


10 


1.012 


0.46612 


29 


19 


2790 


0.881 


A 4 77CC 

0.177OO 


1 1 


Q 

o 






1 o 


10 




0 33826 


40 


23 


! 2791 


1.837 


0.00553 


13 


10 










1.837 


0.00553 


13 


10 


3020 


1.271 


0.10162 


12 


10 


0.853 


0.10567 


19 


13 


0.965 


0.36499 


36 


25 


2792 


1.504 


0.05096 


12 


10 


0.713 


0.02979 


19 


15 


0.846 


0.16914 


OH 
O I 


£.0 


2793 


1.335 


0.03133 


12 


10 


0.883 


0.18577 


19 


15 


0.916 


0.23865 


36 


27 


2794 


1.936 


0.00176 


13 


9 


0.717 


0.09799 


19 


14 


0.877 


0.22295 


40 


25 


2752 


1.499 


0.03077 


12 


8 


0.808 


0.15363 


17 


13 


1.004 


0.48903 


36 


23 


2795 


0.815 


0.24734 


8 


5 


0.965 


0.41772 


19 


15 


0.938 


0.3265 


32 


22 


119 


1.272 


0.20279 


10 


10 










1.272 


0.20279 


10 


10 



203 



WO 2004/042346 



PCT/US2003/012946 



Table 7: Significance analysis for microarrays for identification of markers of acute 
rejection. In each case the highest grade from the 3 pathologists was taken for 
analysis. No rejection and rejection classes are defined. Samples are either used 
regardless of redundancy with respect to patients or a requirement is made that only 
one sample is used per patient or per patient per class. The number of samples used in 
the analysis is given and the lowest FDR achieved is noted. 



No Rejection 


Rejection 


# Samples 


Low FDR 


All Samples 


Grade 0 


Grade 3A-4 


148 


1 


Grade 0 


Grade 1B f 3A-4 


158 


1.5 


Non-redundant within class 


Grade 0 


Grade 3A-4 


86 


7 


Grade 0 


Grade 1B, 3A-4 


93 


16 


Non-redundant (1 sample/patient) 


Grade 0 


Grade 3A-4 


73 


11 
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Table 8: Renal rejection tissue gene expression SAM analysis 



Array 
probe ID 


Gene 


FDR 


Protein 
SEQ ID 


Leukocyte 
expression 


Secreted 


1 2697 


CD69 antigen (p60. early T-cell activat 


1 .5625 


2925 


+ 




| 2645 


Ras association (RalGDS/AF-6) 


1.5625 


2926 






I 2707 


CD33 antigen (gp67) (CD33), mRNA 


1 .5625 


2927 


+ 




2679 


Ras association (RalGDS/AF-6) domain fa 


1.5625 


2928 






| 2717 


EST, 5 end 


1.5625 








) 2646 


mRNA for KIAA0209 qene, partial cds /cd 


1 .5625 


2929 






2667 


leupaxin (LPXN). mRNA /cds=(93.1 253) 


1.5625 


2930 


+ 




2706 


c- EST 3 end /clone=IMAGE: 


2.1111 








2740 


c- insulin induced gene 1 (INSIG1), mRNA 


2.2 








| 117 


chemokine (C-X-C motif) receptor 3 


2.8125 


2931 






! 2669 


IL2-inducible T-cell kinase (ITK), mRNA 


2.8125 


2932 


+ 




2674 


glioma pathogenesis-related protein (RT 


2.8125 


2933 






2743 


c- nuclear receptor subfamily 1 , group I 


2.8125 








! 326 


death effector filament-forming Ced-4-l 


2.8125 


2934 






2716 


ESTcDNA. 3 end 


2.8125 








2727 


c- chemokine (C-X-C motif), receptor 4 


3.1316 


2935 


+ 




2721 


c- EST 3 end /clone=IMAGE: 


3.1316 








2641 


hypothetical protein FLJ20647 (FLJ20647 


3.1316 


2936 






2671 


tumor necrosis factor, alpha-induced pr 


3.525 


2937 






2752 


protein tyrosine phosphatase, receptor 


3.8077 


2938 


+ 




2737 


7f37g03.x1 cDNA, 3 end /clone=IMAGE: 


3.8077 








2719 


c- EST372075 cDNA 


3.8077 








2684 


molecule possessing ankyrin repeats ind 


3.8077 


2939 






76 


granzyme B (granzyme 2, cytotoxic T-lym 


3.8077 


2940 


+ 


+ 


2677 


lectin-like NK cell receptor (LLT1), mR 


3.8077 


2941 


+ 




2748 


C-107G11 


3.9 








2703 


c- EST, 5 end /clone=IMAGE 


3.9 








2711 


SAM domain, SH3 domain and nuclear 


3.9 


2942 






2663 


phosphodiesterase 4B, cAMP-specific 


3.9 


2943 




+ 


98 


small inducible cytokine A5 (RANTES) 


4.5645 


2944 


+ 


+ 


2657 


tumor necrosis factor receptor superfam 


4.8286 


2945 






j 2683 


B-cell lymphoma/leukaemia 11B (BCL11B) 


4.8286 


2946 


+ 




2686 


phospholipase A2, group VII (platelet-a 


4.8286 


2947 




+ 


2687 


phosphatidylinositol 3-kinase catalytic 


4.8286 


2948 






2644 


AV659177 cDNA. 3 end 


4.9028 








2664 


regulator of G-protein signalling 10 (R 


5.0238 


2949 






2747 


c- integral membrane protein 2A (ITM2A), 


5.0238 


2950 






2744 


c- interferon consensus sequence binding 


5.0238 








2678 


HSPC022 protein (HSPC022). mRNA 


5.0238 


2951 






2731 


c-xi98c03.x1 NCI CGAP Co18cDNA 


5.0238 








2713 


caspase recruitment domain protein 9 (L 


5.0238 


2952 






2736 


c- small inducible cytokine A4 (homologo 


5.1395 


2953 


+ 


+ 


2708 


major histocompatibility complex, class 


5.15 


[ 2954 






249 


C-107H8 


5.15 








2670 


CD72 antigen (CD72), mRNA 


5.15 


2955 


+ 




2661 


heat shock 70kD protein 6 (HSP70B ) 


5.15 


2956 






2680 


bridging integrator 2 (BIN2), mRNA /cds 


5.15 


2957 






2754 


UI-H-BWO-aiy-b-10-0-Ul.s1 cDNA,3 end 


5.15 








| 2728 


c- EST380762 cDNA 


5.15 








i 174 


FKBPL 


5.15 


2958 






2742 


c- chromobox homolog 3 (DM) 


5.15 








2668 


basement membrane-induced gene(ICB-l) 


5.15 


2959 






2750 


Lysosomal-assoc. multispanning memb 


5.15 


2960 
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mi I eiy 
nrnhp ID 


VJCI IC 


FDR 


Protein 

• ■ vy m-w 111 

SEQ ID 


Leukocyte 
expression 


Secreted 


97AR 

Z f HO 


174H1 

1 / HL/ 1 


5.15 








97^A 
Z r JO 


r- AV71RR97 rDNA 5 pnd 


5.15 








9R97 
ZOZ / 


colnto warrior familw 17 f^nriiiim nhosnh 


5.15 


2961 






Zf 


r ocnaraninv/l-tRM A ^vnthptfl^P INARS) 
L» aopaiayiliyi irviNA^ oyiiu iciooc ^nr\i 


5.15 








1 OA 


major nisiuouinpaiiijiiiiy out 1 i pic a , waoo 


5.15 


2962 






ZD**/ 


rnrviNAA for i -ecu opcunii,* piuiciii /oua 


5.15 


2963 






zoZo 


~ COT Q Anrl 

c-to i , o ena 


5.2295 








ZOOO 


express cuiNrA nor ar y ul/in/a o 


5 2903 








Zf ZD 


Rm*V71R7QP1 rnMA R pnrl 
C- OU ID' lOr 57i • uUINrx, O cl 1U 


5 3385 


2964 






Zf l*f 


nn7flrn(\ v1 rHM A *^ onH /plnrlP 

qy/ocuo.xi cuinm, o ciiu /biunc 


5 3385 


2965 






ococ 
ZDOD 


inierieuKin z recepior ycJinma uudin 


5 3385 


2966 


+ 




zYol 


fZo4 f lecun, yaiaciosioe-Dinauiy, s>uiuuic 


S 4167 


2Q67 




+ 


zozy 


o «nMA- CI !91^RQ fie HnnP CLO\ 0R40R 
O, CUINM. rLJZ I DOi7 Mo, UlUllc uULUOtuO 


5 5299 


2968 








mDMA- r-nMA H K P'Znd.'^ PO ^ 1 fi 


5 5588 


2969 






07>| i 

z# *n 


hovnl/inaco 9 f 1—1 1^9 ^ mRMA 

C- nexoKinase z ^nr\zj, ■ t 1 iain/a 


5 5986 ! 








Hi 


oirniiar 10 major ni©iuuuiiipaiiLiiiii.y aniiycn 


5 5986 


2970 








f^n^ antint&n /nRR-R9^ /f^DS^ 


5 5986 


2971 






979R 

z/ zo 


r» Rn9R^n^70T1 rDNA ^ 
C" OOZOOUO f w 1 1 OL/ INrA, O 


5.6014 








0700 

Zf zz 


C- CO 1 CU1NM OlUIlt? 


5 6014 








zooy 


inieneuKin-z r ecepiur 


5.6014 


2972 






Z f o*t 


^ ni irlaar roronfnr ci ihfamilv/ 1 firni in 1 
C llUUIcal IcUcpiUI oUUialllliy 1, yiuuj-F i . 


5.6667 








ZOO I 


prs~D~ccii woiuiiy ~t?iiiiaiii*iiiy • av^ivji 


5.7566 


2973 






ZDOD 


nnctmointir' q on r^n potion inprp?l<*P(i 


5.7756 


2974 






9RQR 

zoyo 


piUlv>iii lyiudiiic? pnudpi laiaow, i cucpiui 


5.7756 


2975 






9A7R 
ZD ( D 


Hi iK/mnhilin ci iKfsimilw *^ mpmhpr A9 
DUiyropi 1 1 1 1 1 1 , oUUi al I iiiy o, iiiciiiuci 


5.8165 


2976 






97H1 
Z f U I 


C" CO 1 O CIIU 


5.9048 








Zf OU 


co i o enu /uioiits - iiviaavjc 


5.9048 








97 in 

Zf IU 


hinh offln immi inr>nlnhi ilin pn^ilnn rpppnt 


5.9048 


2977 






9R**9 
ZDOZ 


cnLrUUiny niajv/i iiioiov*uiiipauLiiiiiy t»ui i ipic 


5.9048 


2978 






Zf Z*» 


c- Co 1 o ena 


R Q04R 








Z0U0 


COT 
CO 1 










ZooZ 


inierreron reyuiaxory lacior i ^ir\r i ^, 


R OQRft 


2979 






ioy 


alloy ran iniiarnmaiory laoior i ^Mir i 


fi 1^7Q 


2980 






Z f Dj 


nlotolof or*ti\/otir»n rorpntnr hnmolnn > 
piaiclcl aullVaiiny icucpiui iiuiiiuiuy \n 


6 31 82 


2981 






97HA 
Z f U*f 


r- P^T ^ pnH /Hnnp=IMAGP' 

L» CO I O ClIU /OHJl IC IIVIAAVjl^. 


7.0337 








ZOf O 


pim"Z onuoyciic \n iivijcj, iur\i>iAA 


7 1222 


2982 




+ 


97HH 
Z f UU 


pr ULcuyiyoai i i , otswtsioiy yiaiiuic \* rvv_> i 


7.375 


2983 




+ 


I 9 RAH 


mRMA fnr KIAAOR7H nrnfpin nartial rri<% 


7.375 


2984 






979** 
Z / Zo 


r- P^T ^ pnH /rlonp=IMA(^P 

O CO 1 , O Cl l(J /OlOI IC — IIVIAAVJ t 


7.375 








zooo 


FYM hinHinn nrnfpin /FYR-1 90/1 *\C)\ (FYfK} 
r T iN-oinaiiiy pioicii i \t • d i ^u/ i ouy \i • 


7 375 


2985 






9RRR 
ZOOO 


mainr KictoorkmnAtihilitv/ rnmnlpy place 


7 375 


2986 






Z r OJ 


r*- P<5T ^ PnH /rlnnpsl MAGE' 


7.375 








97H9 
Z f UZ 


c~ nypoinciioai pioicni ivix3o*r/w» 


7.634 








ZOO 1 


hwrsnthotir>c»l nrntoin PI HORR9 

nypoincULfai pioiciii ■ lj i uuj^ 


8.11 17 


2987 






Zf DO 


COT ^ onH 

co i , o ena 


8.1117 








2715 


hypothetical protein FLJ 10842 


8.1117 








2732 


c-ESTcDNA,3 end 


8.1117 








2652 


hexokinase 2 (HK2) t mRNA 


8.1117 








2651 


colony stimulating factor 3 receptor 


8.1117 


2988 






2718 


RNA binding motif protein, X chrom 


8.2788 








2673 


Src-like-adapter (SLA). mRNA 


8.3048 


2989 






2733 


c- major histocompatibility complex 


8.467 








2712 


histamine receptor H2 (HRH2) 


8.8583 


2990 
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Array 
probe ID 


Gene 


FDR 


Protein 
SEQ ID 


Leukocyte 
expression 


Secreted 


2659 


hemopoietic cell kinase (HCK) 


8.8583 


2991 






2654 


xanthene dehydrogenase (XDH) 


8.8583 


2992 






2636 


Arabidopsis root cap 1 


8.8583 


2993 






2639 


fatty acid binding protein 1 , liver 


8.8583 








2690 


adenosine deaminase (ADA) 


8.8583 | 


2994 






2705 


c- EST, 3 end 


8.8583 


2995 






2685 


hypothetical protein MGC10823 


8.8583 


2996 






2692 


membrane-spanning 4-domains, 


8.8583 


2997 






2693 


rearranged immunoglobulin mRNA for mu 


8.8583 






+ 


2648 


protein tyrosine kinase related mRNA 


8.8583 








2650 


major histocompatibility complex, class 


8.8583 


2998 






2720 


c- EST 3 end /clone=IMAGE: 


8.8583 






t 


2660 


major histocompatibility complex, class 


8.8583 


2999 






2666 


BCL2-related protein A1 (BCL2A1), mRNA 


9.1446 


3000 






2699 


c-EST 


9.4767 








i 2633 


interleukin 4 receptor 


9.4767 


3001 






! 74 


tumor necrosis factor (ligand) superfam 


9.4767 


3002 






1 2672 


interferon-induced, hepatitis C-assoc. 


9.4767 


3003 






2642 


cDNA FLJ20673 fis, clone KAIA4464 


9.4767 


3004 






2682 


VNN3 protein (HSA238982), mRNA 


9.4767 


3005 






2655 


cathepsin K (pycnodysostosis) (CTSK) 


9.4767 


3006 






2630 


Integrin, alpha L (CD11A (p180), lymphoc 


9.4767 


3007 






2745 


EST. 5 end 


9.4885 


3008 






2643 


nuclear receptor subfamily 1, group I, 


9.625 








2694 


CDW52 antigen (CAMPATH-1 ) 


9.625 


3009 






2749 


6977.C-178F5 


9.6903 


3010 






2665 


small inducible cytokine subfamily A 


9.6903 


3011 






2649 


signal transducer and activator 


9.7878 


3012 






2637 


324, 


9.7878 








2634 


70 activation (Act-2) mRNA 


9.7878 


3013 






2709 


coagulation factor VII 


9.7878 


3014 






2653 


integrin, beta 2 (antigen CD18 (p95) 


9.7878 


3015 






2729 


EST 3* end 


9.8321 
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Array 
Probe 
SEQ 
ID 


Gene 


Gene Name 


raRNA 
Accession # 


RefSeq 
Peptide 
Accession # 


Current 
UniGene 
Cluster 
(Build 156) 


Localization 


Function 


111 


IL15 


Intcrleukin 15 


NM_OO0585 


NP_000576 


Hs.168132 


Secreted 


T-cell activation and 
proliferation 


79 


PRFI 


Perforin 1 (pore 
forming protein) 


NM_005041 


NP_005032 


Hs.2200 


Secreted 


CD8, CTL effector; 
channel-forming protein 
capable of lysing non- 
specifically a variety of 
target cells; clearance of 
virally infected host cells 
and tumor cells; . 


110 


IL17 


Interleukin 17 
(cytotoxic T- 
lymphocyte- 
associated serine 
esterase 8) 


NM_002190 


NP_002181 


Hs.41724 


Secreted 


Induces stromal cells to 
produce 

proinflammatory and 
hematopoietic cytokines; 
enhances IL6, IL8 and 
ICAM-I expression in 
fibroblasts; osteoclastic 
bone resorption in RA; 
expressed in only in 
activated CD4+T cells 


75 


IL8 


Interleukin 8 


NM_000584 


NP_000575 


Hs.624 


Secreted 


Proinflammatory 
cytokine 


120 


CXCL1 


Chemokine (C-X- 
C motif) ligand 1 
(melanoma growth 
stimulating 
activity, alpha) 


NMJKH5U 


NP_001502 


Hs.789 


Secreted 


Neurogenesis, immune 
system development, 
signaling 


113 


IFNG 


Interferon, gamma 


NM_000619 


NPJ)00610 


Hs.856 


Secreted 


Antiviral defense and 
immune activation 


100 


IL2 


Interleukin 2 


NM_000586 


NP_000577 


Hs.89679 


Secreted 


Promotes growth of B 
and T cells 


4 


B2M 


beta 2 

microglobulin 


NM_Q04048 


NP_004039 


Hs.75415 


Secreted 




98 


CCL5 


Chemokine (C-C 
motif) ligand 5 
(RANTES, 
SCYA5) 


NM_002985 


NP_002976 


Hs.241392 


Secreted 


Chemoattractant for 
monocytes, memory T 
helper cells and 
eosinophils; causes 
release of histamine 
from basophils and 
activates eosinophils; 
One of the major HIV- 
suppressive factors 
produced by CD8+ cells 


112 


ILIO 


Interleukin 10 


NM_000572 


NP_000563 


Hs. 1937 17 


Secreted 


Chemotactic factor for 
CD8+T cells; down- 
regulates expression of 
Thl cytokines, MHC 
class II Ags, and 
costimulatory molecules 
on macrophages; 
enhances B cell survival, 
proliferation, and 
antibody production; 
blocks NF kappa B, 
JAK-STAT regulation; 


80 


IL4 


Interleukin 4 


NM_000589 


NP_000580 


Hs.73917 


Secreted 


TH2, cytokine, 
stimulates CTL 


2773 


IL7 


Interleukin 7 


NMJX)0880 


NPJ)00871 


Hs.72927 


Secreted 


Proliferation of 
lymphoid progenitors 



208 



WO 2004/042346 
Table 9 



PCT/US2003/012946 



Array 
Probe 
SEQ 
ID 


Gene 


Gene Name 


mRNA 
Accession # 


RefSeq 
Peptide 
Accession # 


Current 
UniGene 
Cluster 
(Build 156) 


Localization 


Function 


109 


CXCLIO 


Chemokine (C-X- 
C motif) ligand 10, 
SCYB10 


NM_00I565 


NP_00I556 


Hs.2248 


Secreted 


Stimulation of 
monocytes; NK and T 
cell migration, 
modulation of adhesion 
molecule expression 


2665 


CCLI7 


Chemokine (C-C 
motif) ligand 1 7 


NM_002987 


NP_002978 


Hs.66742 


Secreted 


T cell development, 
trafficking and activation 


101 


KLRF1 


Killer cell lectin- 
like receptor 
subfamily F, 
member 1 


NM_0 16523 


NP_057607 


Hs. 183 125 


Secreted 


Induction of IgE, lgG4, 
CD23, CD72, surface 
IgM, and class II MHC 
antigen in B cells 


99 


IL6 


Interleukin 6 


NM_000600 


NP_000591 


Hs.93913 


Secreted 


B cell maturation 


104 


CCL4 


Chemokine (C-C 
motif) ligand 4 


NM_002984 


NP_002975 


Hs.75703 


Secreted 


Inflammatory and 
chemokinetic properties; 
one of the major H1V- 
suppressive factors 
produced by CD8+ T 
cells 


76 


GZMB 


Granzyme B 
(granzyme 2, 
cytotoxic T- 
lymphocyte- 
associated serine 
esterase 1) 


NM_004I31 


NP_004122 


Hs.1051 


Secreted 


Apoptosis; CD8, CTL 
effector 


2785 


OID_4789 


KIAA0963 protein 


NM_014963 


NP_055778 


Hs.7724 


Secreted 


Proinflammatory; 
chemoattraction and 
activation of neutrophils 


2791 


XCL1 


Chemokine (C 
motif) ligand 1 
(SCYC2) 


NM_002995 


NP_002986 


Hs.3195 


Secreted 


Chemotactic factor for 
lymphocytes but not 
monocytes or neutrophils 


130 


PRDM1 


PR domain 
containing 1 , with 
ZNF domain 


NM_00U98 


NP_00U89 


Hs.388346 


Nuclear 


Transcription factor; 
promotes B cell 
maturation, represses 
human beta-IFN gene 
expression 


2781 


TBX2I 


T-box21 


NM_013351 


NPJ>37483 


Hs.272409 


Nuclear 


THl differentiation, 
transcription factor 


88 


MTHFD2 


Methylene 

tetrahydrofolate 

dehvdrocenase 

(NAD+ 

dependent), 

methenyltetrahydr 

ofolate 

cyclohydrolase 


NM_006636 


NPJ306627 


Hs. 154672 


Mitochondrial 


Folate metabolism 


103 


IL2RA 


Interleukin 2 
receptor, alpha 


NMJ)00417 


NP_000408 


Hs.1724 


Membrane- 
bound and 
soluble forms 


T cell mediated immune 
response 


77 


TNFSF6 


Tumor necrosis 
factor (ligand) 
super family, 
member 6 


NM_000639 


NP_000630 


Hs.2007 


Membrane- 
bound and 
soluble forms 


CD8, CTL effector; 
proapoptotic 


115 


CD8B1 


CD8 antigen, beta 
polypeptide 1 
(P37) 


NM_004931 


NPJ)04922 


Hs.2299 


Membrane- 
bound and 
soluble forms 


CTL mediated killing 


128 


PTGS2 


Prostaglandin- 
endoperoxide 
synthase 2 
(prostaglandin 
G/H synthase and 
cyclooxygenase) 


NM_000963 


NP_000954 


Hs. 196384 


Membrane- 
associated 


Angiogenesis, cell 
migration, synthesis of 
inflammatory 
prostaglandins 
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A rray 
Probe 
SEQ 
ID 


Gene 


Gene Name 


mRNA 
Accession # 


RefSeq 
Peptide 
Accession # 


Current 
UniGene 
Cluster 
(Build 156) 


Localization 


Function 


89 


TAP I 


Transporter 1 , 
ATP-binding 
cassette, sub- 
family B 
(MDRI/TAP) 


NM_000593 


NP_000584 


Hs.352018 


ER membrane 


Transports antigens into 
ER for association with 
MHC class I molecules 


92 


IGHM 


Immunoglobulin 
heavy constant mu 


BC032249 




Hs.300697 


Cytoplasmic 
and secreted 
forms 


Antibody subunit 


122 


GPI 


Glucose phosphate 
isomerase 


NM_000175 


NPJ)00J66 


Hs.409162 


Cytoplasmic 
and secreted 
forms 


Glycolysis and 
gluconeogenesis 
(cytoplasmic); 
neurotrophic factor 
(secreted) 


2783 


GSN 


Gelsolin 
(amyloidosis, 
Finnish type) 


NM_000177 


NP_000168 


Hs.290070 


Cytoplasmic 
and secreted 
forms 


Controls actin filament 
assembly/disassembly 


2780 


STK39 


Serine threonine 
kinase 39 
fSTE20/SPS 1 
homolog, yeast) 


NMJH3233 


NPJ)37365 


Hs. 199263 


Cytoplasmic 
and nuclear 


Mediator of stress- 
activated signals; 
Serine/Thr Kinase, 
activated p38 


2770 


PSMB8 


Proteasome 
(prosome, 
macropain) 
subunit, beta type, 
8 (large 

multifunctional 
protease 7) 


AK092738 




Hs. 180062 


Cytoplasmic 


Processing of MHC class 
1 antigens 


2667 


LPXN 


Leupaxin 


NM_004811 


NP_004802 


Hs.49587 


Cytoplasmic 


Signal transduction 


2669 


ITK 


IL2-inducible T- 
cell kinase 


L107I7 




Hs.211576 


Cytoplasmic 


Intracellular kinase, T- 
cell proliferation and 
differentiation 


90 


KPNA6 


Karyopherin alpha 
6 (imoortin aloha 

\J III! IUUI ft 1 1 1 Uipilli 

7) 


AW021037 




Hs.301553 


Cytoplasmic 


Nucleocytoplasmic 
transport 


2794 


SH2D2A 


SH2 domain 

nrntein 2 A 


NM_003975 


NP_003966 


Hs. 103527 


Cytoplasmic 


CD8 T activation, signal 
transduction 


2765 


TNFSF5 


Tumor necrosis 
factor (ligand) 
superfamily, 
member 5 (hyper- 
IgM syndrome) 


NMJ)00074 


NP_000065 


Hs.652 


Cellular 
membrane 


B-cell proliferation, IgE 
production, 
immunoglobulin class 
switching; expressed on 
CD4+ and CD8+ T cells 


97 


CD69 


CD69 antigen 
(p60, early T-cell 
activation antigen) 


NMJ>01781 


NP_001772 


Hs.82401 


Cellular 
membrane 


Activation of 
lymphocytes, 
monocytes, and platelets 


2635 


IL2RG 


Interleukin 2 
receptor, gamma 
(severe combined 
immunodeficiency 
) 


NM 000206 


NP 000197 


Hs.84 


Cellular 
membrane 


Signalling component of 
many interleukin 
receptors 

(IL2,IL4,IL7,IL9, and 
IL15), 


96 


CXCR4 


Chemokine (C-X- 
C motif) receptor 
4 


NM_003467 


NPJ)03458 


Hs.89414 


Cellular 
membrane 


B-cell lymphopoiesis, 
leukocyte migration, 
angiogenesis; mediates 
intracellular calcium flux 


2766 


CD19 


CD 19 antigen 


NM_001770 


NP_001761 


Hs.96023 


Cellular 
membrane 


Signal transduction; B 
lymphocyte 

development, activation, 
and differentiation 
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Array 
Probe 
SEQ 
ID 


Gene 


Gene Name 


mRNA 
Accession # 


RefSeq 
Peptide 
Accession # 


Current 
UniGene 
Cluster 
(Build 156) 


Localization 


Function 


2769 


ITGBI 


Integrin, beta I 
(fibronectin 
receptor, beta 
polypeptide, 
antigen CD29 
includes MDF2, 
MSK12) 


NM_0022II 


NP_002202 


Hs.287797 


Cellular 
membrane 


Cell-cell and cell-matrix 
interactions 


2647 


TRB 


T cell receptor 
beta, constant 
region 


K02885 




Hs.300697 


Cellular 
membrane 


Antigen recognition 


82 


CTLA4 


Cytotoxic T- 
lymphocyte- 
associated protein 
4 


NM_005214 


NP_005205 


Hs.247824 


Cellular 
membrane 


Negative regulation of T 
cell activation, expressed 
by activated T cells 


83 


CD8A 


CD8 antigen, 
alpha polypeptide 
(P32) 


NM_001768 


NPJXH759 


Hs.85258 


Cellular 
membrane 


CD8 T-cell specific 
marker and class I MHC 
receptor 


114 


HLA-DRB1 


Major 

histocompatibility 
complex, class 11, 
DR beta 1 


NM_002124 


NPJ)02115 


Hs.308026 


Cellular 
membrane 


Antigen presentation 


2772 


CD3Z 


CD3Z antigen, 
zeta polypeptide 
(TiT3 complex) 


NMJ)Q0734 


NPJX)0725 


Hs.97087 


Cellular 
membrane 


T-cell marker; couples 
antigen recognition to 
several intracellular 
sign al -transduction 
pathways 


2 


ACTB 


Actin, beta 


NMJKH10I 


NP_00I092 


Hs.288061 


Cellular 
membrane 


Cell adhesion and 
recognition 


2774 


1TGAL 


Integrin, alpha L 
(antigen CD 11 A 
(Pl80), 
lymphocyte 
function- 
associated antigen 
1 ; alpha 
polypeptide) 


NMJW2209 


NP_002200 


Hs. 174103 


Cellular 
membrane 


All leukocytes; cell-cell 
adhesion, signaling 


78 


TCIRG1 


T-cell, immune 
regulator 1 , 
ATPase, H+ 
transporting, 
lysosomal V0 
protein a isoform 3 


NM_006019 


NP_006010 


Hs.46465 


Cellular 
membrane 


T cell activation 


2670 


CD72 


CD72 antigen 


NM_001782 


NPJ)0I773 


Hs. 116481 


Cellular 
membrane 


B cell proliferation 


2779 


D12S2489E 


DNA segment on 
chromosome 12 
(unique) 2489 
expressed 
sequence 


NM_007360 


NP_031386 


Hs.74085 


Cellular 
membrane 


NK cells marker 


2692 


MS4A1 


Membrane- 
spanning 4- 
domains, 
subfamily A, 
member 1 , CD20 


NM^l 52866 


NP_690605 


Hs.89751 


Cellular 
membrane 


B-cell activation, plasma 
cell development 


126 


TCRGC2 


T cell receptor 
gamma constant 2 


Ml 7323 




Hs. 11 2259 


Cellular 
membrane 




116 


CD4 


CD4 antigen (p55) 


NMJ)00616 


NP_000607 


Hs. 17483 


Cellular 
membrane 


T cell activation, signal 
transduction, T-B cell 
adhesion 
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Array 
Probe 
SEQ 
ID 


Gene 


Gene Name 


mRNA 
Accession # 


RefSeq 
Peptide 
Accession # 


Current 
UniGene 
Cluster 
(Build 156) 


Localization 


Function 


117 


CXCR3 


Chemokine (C-X- 
C motif) receptor 
3, GPR9 


NM_001504 


NPJXM495 


Hs. 198252 


Cellular 
membrane 


Integrin activation, 
cytoskeletal changes and 
chemotactic migration of 
leukocytes 


2707 


CD33 


CD33 antigen 
(gp67) 


NMJXH772 


NPJ)01763 


Hs.83731 


Cellular 
membrane 


Cell adhesion; receptor 
that inhibits the 
proliferation of normal 
and leukemic myeloid 
cells 


123 


CD47 


CD47 antigen (Rh- 
related antigen, 
integrin-associated 
signal transducer) 


NM_001777 


NP_001768 


Hs.82685 


Cellular 
membrane 


Cell adhesion, 
membrane transport, 
signaling transduction, 
permeability 


84 


BY55 


Natural killer cell 

receptor, 

immunoglobulin 

superfamily 

member 


NM_007053 


NPJ>08984 


Hs.81743 


Cellular 
membrane 


NIC cells and CTLs, 
costim with MHC I 


2784 


KLRD1 


Killer cell lectin- 
like receptor 
subfamily D, 
member 1 


NM_002262 


NP_002253 


Hs.41682 


Cellular 
membrane 


NK cell regulation 


124 


HLA-F 


Major 

histocompatibility 
complex, class I, F 


NM_0 18950 


NPJ)61823 


Hs.377850 


Cellular 
membrane 


Antigen presentation 


2752 


PTPRCAP 


Protein tyrosine 
phosphatase, 
receptor type, C- 
associated protein 


NM_005608 


NP_005599 


Hs. 155975 


Cellular 
membrane 


T cell activation 
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Table 12: Markers for CMV Infection 



PCT7US2003/012946 



New 
SEQID 


Source 


Unigene 


Acc 


Gl 


Name 


Strand 


Probe Sequence 


SAM 
FDR 


408 


cDNA 


Hs.1051 


NMJDG4131 


7262379 


granzyme B 


1 


GGAGCCAAGTCCAGATT 
TACACTGGGAGAGGTGC 
CAGCAACTGAATAAAT 


0% 


3108 


db 

mining 


Hs. 169824 


NM_002258 


4504878 


killer cell lectin- 
like receptor 


1 


TGGATCTGCCAAAAAGA 
ACTAACACCTGTGAGAA 
ATAAAGTGTATCCTGA 


0% 


3109 


cDNA 


Hs.170019 


NM_004350 


4757917 


runt-related 
transcription 
factor 3 


1 


GCTGGGTGGAAACTGCT 
TTGCACTATCGTTTGCT 
TGGTGTTTGT TTTTAA 


0% 


433 


cDNA 


Hs. 1831 25 


NM_0 16523 


7705573 


killer cell lectin- 
like receptor F 


1 


TTCCAGGCTTTTGCTAC 
TCTTCACTCAGCTACAA 
TAAACATCCTGAATGT 


0% 


3110 


db 

mining 


Hs.2014 


X06557 


37003 


T-cell receptor- 
delta 


1 


GGGGTTTATGTCCTAAC 
TGCTTTGTATGCTGTTT 
TATAAAGGGATAGAAG 


0.10% 


3111 


cDNA 


Hs.211535 


AI823649 


5444320 


EST 

IMAGE:240014 
8 


-1 


GAAGCCTTTTCTTTTCT 
GTTCACCCTCACCAAGA 
GCACAACTTAAATAGG 


0.10% 


3112 


cDNA 


Hs.301704 


AW002985 


5849991 


eomesodermin 

(Xenopus 

laevis) 


-1 


AACAAGCCATGTTTGCC 
CTAGTCCAGGATTGCCT 
CACTTGAGACTTGCTA \ 


0% 


3112 


Table 3B 


Hs.301704 


AW002985 


5849991 


eomesodermin 

(Xenopus 

laevis) 


-1 


AACAAGCCATGTTTGCC 
CTAGTCCAGGATTGCCT 
CACTTG AGACTTG CTA 


0% 


3113 


cDNA 


Hs.318885 


NM_000636 


10835186 


superoxide 
dismutase 2 


1 


TACTTTGGGGACTTGTA 
GGGATGCCTTTCTAGTC 
CTATTCTATTGCAGTT 


0.10% 


3114 


literature 


Hs.41682 


NM_007334 


7669498 


killer cell lectin- 
like receptor D 


1 


GGGCAGAGAAGGTGGAG 
AGTAAAGACCCAACATT 
ACTAACAATGATACAG 


0% 


3115 


cDNA 


Hs.71245 


AI954499 


5746809 


EST 

IMAGE:502221 


-1 


TGGTAATAGTGTTTGAC 
TCCAGGGAAGAACAGAT 
GGGTGCCAGAGTGAAA 


0% 


3116 


cDNA 


Hs.75596 


NM_000878 


4504664 


interleukin 2 
receptor, beta 


1 


ATGGAAATTGTATTTGC 
CTTCTCCACTTTGGGAG 
GCTCCCACTTCTTGGG 


0% 


436 


cDNA 


Hs.75703 


NM_002984 


4506844 


small inducible 
cytokine A4 


1 


CCACTGTCACTGTTTCT 
CTGCTGTTGCAAATACA 
TGGATAACACATTTGA 


0% 


436 


cDNA 


Hs.75703 


NM_002984 


4506844 


small inducible 
cytokine A4 


1 


CCACTGTCACTGTTTCT 
CTGCTGTTGCAAATACA 
TGGATAACACATTTGA 


0.10% 


436 


cDNA 


Hs.75703 


NM_002984 


4506844 


small inducible 
cytokine A4 


1 


GTCCACTGTCACTGTTT 
CTCTGCTGTTGCAAATA 
CATGGATAACACATTT 


0% 


436 


cDNA 


Hs.75703 


NMJ)02984 


4506844 


small inducible 
cytokine A4 


-1 


TGGTCCACTGTCACTGT 
TTCTCTGCTGTTGCAAA 
TACATGGATAACACAT 


0.10% 


415 


cDNA 


Hs.85258 


BC025715 


19344021 


CD8 antigen 


1 


CTGAGAGCCCAAACTGC 
TGTCCCAAACATGCACT 
TCCTTGCTTAAGGTAT 


0.10% 


3117 


cDNA 


Hs.111554 


AA806222 


2874972 


cDNA 196D7 


-1 


TGATTTCTGTAATGTTT 
GACCTAATAATAGCCCT 
TTTCGTCTCTGACCCA 


0% 


WBC 


N/A 


N/A 


N/A 


N/A 




N/A 


N/A 


0.10% 


WPT 


N/A 


N/A 


N/A 


N/A 




N/A 


N/A 


0% 
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